SPbU SPbU
  • For Partners
  • Job Opportunities at SPbU
  • Contacts
  • Library
  • RU
  • 中文
  • About SPbU
    • The University Board of Trustees
    • History
    • Administration
    • International Cooperation
    • The University in Persons
    • Museums and Collections
    • Green Campus
    • About St Petersburg
    • Pirogov Clinic
    • Academic and Research Departments
    • University Giftshop
    • For Alumni
    • By-laws and Regulations
    University Introduction
  • Admission
    • Educational Programmes
    • Admissions Procedure
    • Documents Required
    • Independent Aspirantura Studies
    • International Admissions Office
    • Preparatory Course
    • Recognition of Foreign Educational Credentials
    • Tuition
    • Visa Support
    How to Apply
  • Education
    • Student Life
    • Internship
    • Accessible Environment
    • Accommodation
    • Clinics
    • Courses Taught in Foreign Languages
    • Heads of the Academic Offices
    • Online Courses
    • Scholarships and Grants
    • Services
    • Useful Information for International Students
    • Students Exchange Programmes (SEP)
    • Career Centre
    • International Student Club
    • Medical Services
    Russian Education System
  • Research
    • Research Park
    • M. Gorky Scientific Library of SPbU
    • Funding Opportunities
    • Research Internship Programme
    • Research Repository
    • Council of Young Scientists
    • Journals at SPbU
    • University Spin-offs
    • Intellectual Property
    • Visiting Professors
    • Pure System
    Resources Overview
  • News and Events
    • News
    • Calendar
    • Student Reviews
    • University: A Fresh Start
    • Rector's Interviews
    • University in Media
    Read more
News
  • News
  • Calendar
  • Student Reviews
  • University: A Fresh Start
  • Rector's Interviews
  • University in Media
News and Events News
9 February 2018 News

The Lord of the Rings by J. R. R. Tolkien and Foundation by Isaac Asimov: SPbU mathematicians have analysed the texts of the world's best sellers

Scientists at St Petersburg University and the Institute for Intelligent Information Processing of ORT Braude College, Israel, offered a new solution for computer research of text authorship and style. The solution is based on modelling the dynamic process of text creation.

A unique approach enabled the scientists to analyse the works of J. R. R. Tolkien, Isaac Asimov, Arthur C. Clarke and many other renowned authors by exploring how their individual style had been changing throughout the years. The findings of the research group's recent study have been published in the Pattern Recognition journal of the Elsevier publishing house.

The paper was written by SPbU postdoc Konstantin Amelin, Candidate of Science in Physics and Mathematics; SPbU Professor Oleg Granichin; Natalya Kizhaeva, an aspirantura programme student at the SPbU Department of System Programming; and Zeev Volkovich, Ph. D., Head of the Institute for Intelligent Information Processing at ORT Braude College, Israel, Dean of Computer Faculty of the ORT Braude College.

The mathematicians selected some well-known works of literature: Foundation, a cycle of seven science fiction novels by Isaac Asimov; The Forsyte Saga, a series of works by John Galsworthy; The Lord of the Rings, a novel in three volumes by J. R. R. Tolkien; and other books. In their previous papers they had already analysed the works of J. K. Rowling (the Harry Potter series). The researchers are interested in large arrays of texts that the author has been creating over time: the mathematical approach makes it possible to see how the author's individual style has been changing.

One can work with big data using traditional methods, i. e. classification, searching for related elements, similarities or groups. We introduced a new big data analysis algorithm and suggested exploring the way it was being created.

SPbU Professor Oleg Granichin, Doctor of Science in Physics and Mathematics

"Any text was either written, or pronounced, or any other way recorded by someone. This process also has its particular characteristics manifesting themselves, for example, in the author's individual style. Today, we do not just study what data looks like, but reveal the characteristics of the process of creating it. So far no one has analysed texts this way," Oleg Granichin noted. 

In their paper the researchers compared the three books from the The Lord of the Rings series by J. R. R. Tolkien with his other works, namely The Hobbit and The Silmarillion. The method determined quite accurately that the first story was written by the same author who had created the trilogy, yet The Silmarillion differs greatly in style. This is because the book was published after the author's death: the collection of myths and legends of Middle-earth was completed by Christopher Tolkien, the son of John Tolkien, who had been studying his father's drafts for several years.

 "There are notable differences in style and in the works of one author," Natalya Kizhaeva adds. "For instance, the fourth book of the Foundation cycle was written by Isaac Asimov almost 30 years after the third one had been completed — his fans insisted on that. Our method allowed us to divide the seven books of the series into two clusters: those created before 1953 and the ones written after 1982. Over the 30 years the author changed, as well as his environment, his vision of life and, as a consequence, his style of writing."

The employees of the SPbU Research Laboratory for Analysis and Modelling of Social Processes are working on other projects that cut across the humanities and the exact sciences. In July 2016, using a unique technology of manuscript analysis, they managed to prove that the manuscript Al-Khitat ("Description of Egypt") kept at the University of Michigan was very likely to be an original work of the famous Egyptian historian Al-Maqrizi. Prior to that, it had been considered a copy thereof.

Not only sequences of symbols in the text and in the word but also n-gram sequences (connected strings of symbols) served as the basic data for the method of modelling the dynamic process of text creation presented in the paper. For example, if n = 3, instead of six "_mama_" symbols, the computer programme will allocate the following trigrams in the text: "_ma", "mam", "ama", "ma_". Then the document is divided into sub-documents forming an ordered sequence of n-gram occurrence, where a relation is sought between each of the sub-documents and its "neighbours". For that, the methods developed earlier in the signal processing theory are used. They distinguish frequency characteristics in data sequences. The new method determines the individual "frequency characteristics" of the author's style by analogy with the frequencies of physical waves recorded by special-purpose devices.

The authors of the algorithm are planning to test the methodology on the works of the Russian literature as it can be applied to texts written in other languages ​​using the Latin alphabet, Cyrillic alphabet and Arabic script.

The researchers note that their invention can be of help in analysing not only literary works but also unstructured texts. For example, this method will be useful when processing data arrays arriving at operator consoles or at various customer service call centres. The Israeli colleagues apply this invention to determine artificially generated texts written not by a person, but by a machine. For instance, there are programmes fabricating texts that are similar to real scientific papers sometimes accepted for publication in well-known journals. The method makes it possible to distinguish such articles from human-created texts with greater accuracy.

Latest News

A chemist from St Petersburg University speaks about environmentally friendly solvents of the future at the Science Lunch

St Petersburg University expert at the Russian International Energy Forum 2025: ‘Today, the world is not ready to phase out fossil fuels’

Vice-Rector of St Petersburg University Sergey Mikushev named Russia’s top vice-rector for research

Other news

A chemist from St Petersburg University speaks about environmentally friendly solvents of the future at the Science Lunch

16 May 2025 News

St Petersburg University expert at the Russian International Energy Forum 2025: ‘Today, the world is not ready to phase out fossil fuels’

16 May 2025 News

Prep year grind: how an Iranian student earned her spot in St Petersburg University

12 May 2025 Student Reviews

Lectures by Chinese Professors at St Petersburg University

24, 26, 28 April 2025; 15, 22 May 2025 Online lecture

The Red Snowball Tree and Other Works by Shukshin

19 May 2025 Online lecture
"Peterburgskii Dnevnik" newspaper:

Nikolay Kropachev: "Churches at universities are becoming centres of spiritual life"

3 April 2025 Rector's Interviews
  • For Applicants
  • International Admissions Office
  • History of SPbU
  • Museums and Collections
  • Personal Account
  • Additional Programmes
  • Educational Programmes
  • Preparatory Course
  • Russian Language Programmes
  • For Partners
  • Clinics
  • Distributed Ledger Technologies Center of SPbU
  • Event Initiation
  • Language Testing Centre
  • Research Park
  • Multifunctional Payment Assistant
  • The Mediation Centre
  • University giftshop
  • For Students
  • Library
  • Accessible Environment
  • Blackboard
  • Timetable
  • Student's Personal Account
  • Accommodation
  • Internships
  • Students exchange programme and Freemover programme
  • Useful Information For International Students
© St Petersburg University, 2025
7-9 Universitetskaya Embankment, St Petersburg, Russia, 199034
By-laws and Regulations Contacts

This information resource may contain archival materials mentioning individuals or legal entities included in the register of foreign agents by the Ministry of Justice of the Russian Federation, as well as organizations recognized as extremist and banned on the territory of the Russian Federation.

Educational Programmes Russian Language Programmes Preparatory Course
International Admissions Office Contacts