

“After our agents began producing thousands of pages of content at rapid rates we reached a point where we had to limit how much we used AI editing tools. The human workers could no longer keep up with the speed of output generated by machines — there was simply no way to manage that much output fast enough to meet demand from publishers, advertisers, etc. The result was to develop a way to incorporate neural network proofing verification methods into our edit process; therefore reducing time spent proofing by 95%. We went from reviewing 4 hours per article to only 10 minutes per article; no magic involved — just basic math principles. That’s why we added contextual proofing checks directly into the edit workflow. Anyone who does any level of written communication for a living will agree that manually inserting commas and periods into sentences when they were written today appears equally absurd as writing computer software using Notepad.”
The purpose of using a neural network for text proofing is to provide validation of spelling and punctuation errors through an artificial intelligence (AI) model which is able to analyze and understand the entire context of every word in the original text rather than simply analyzing each individual word without analyzing the proper meaning of how the word will be used based on the entire context of the sentence using grammar principles. Errors that will be caught by AI-based punctuation applications fall into the following categories: - When to insert a comma before the word "and" within a sentence. - Insert commas into the appropriate location in a series of sentences (optional). - Inserting punctuation at the beginning of sentences.

Unlike traditional spell/check applications which perform only a cross reference lookup against dictionaries of words, the new AI-based models are able to detect and understand why we should place a comma before the word "and" in a sentence even though other sentences do not require such punctuation (e.g., using the word "but" at the beginning of one sentence means that it should not have a comma inserted before it). The most current versions of these models are currently being trained on billions of tokens within the Russian Language for all types of written material including books containing news articles to novels to business letters; thus allowing them to recognize and understand the nuanced usage of English punctuation that would not be taught in traditional classroom settings, such as author reputation, creative license vs. technical writing margin requirements, etc.
In Microsoft Word and Google Docs, a regular proofreader operates by utilizing a database of words and strict parameters of proofing for errors. If the word is correctly spelled, it is considered an error-free word, even though the word may not be appropriate within its context. For example, the word 'компания' as opposed to 'кампания'.
A neural network (i.e., a model that uses an artificial neural network – a mathematical model created using a computer program design) to proofread text functions differently than a traditional proofreader. A neural network will segment the text into small pieces (tokens), create a map of the relationship of each word's connection to one another, and use that mapping of connections to determine what word (or punctuation) should appear in that portion of the sentence.
Modern AI proofreading models use the Transformer architecture when processing text. Transformer sounds very complex; however, it is a tool that allows the model to analyze (view) the entire text at once instead of looking at only neighboring words. For example, suppose your proofreader checks a long sentence with several clauses using a traditional algorithm (predictive model). In that case, there is a risk of losing the flow of the sentence, as each clause must be proofed independently of the other clauses. A transformer, however, uses the map of the connection of the words from the beginning of the sentence to the end to check the flow of the sentence.
As a result of this, AI models recognize that the presence of commas before the word “which” in a sentence is determined by whether or not “which” has a restrictive clause (or not) following it. The use of Attention Mechanisms give the model the ability to evaluate each of the words' level of importance in a sentence, and when there are subject and predicate-dependent words in the same sentence, the placement of commas between subject and predicate is based on the relationship of those words. As a consequence, comma proofreading becomes significant in neural networks.
An essential point of confusion amongst people studying the Russian language is the differences between Russian and English punctuation. A model that is trained using an English translation of a given input will most likely fail when trying to predict commas for certain cases and agreements, as well as for placing commas in a number of complex sentence structures (compared to the way that this would be expected in English). This is due to the fact that the punctuation of Russian is generally less regularised than that of English language, thus allowing for greater variances and much more creative uses of punctuation.
As a result, high-quality punctuation neural networks will require training on only Russian language documents that are representative of the various genre of writing published in Russian. This includes collections of text from resources such as Wikipedia or news aggregation sites, but also from sources like books, magazines, and research papers.
For a neural network-based model to be successful at punctuation and error correction in Russian, it must be trained using data collected from Russian language professional editors of publications such as "Kommersant" to review the various types of text/case configuration (how commas are placed, how different types of text are structured, and how direct dialogue is formatted in fiction).
At ASCN.AI, we utilized both spelling error-checking and punctuation error-checking agents, which were built to produce SEO-based written content/email campaigns to our clients. Initially, the punctuation model produced strikingly odd comma placements e.g., a comma would be placed before "that" within a complex-output sentence (when identifying the comma placement rule) whereas a dash should have been used. Once we switched to utilizing a Russian language based punctuation model, the model's production accuracy increased from ~78% to ~94%. Additionally, 3 out of the 5 clients surveyed after the performance evaluation (of 5 surveyed clients) indicated that the documents produced with AI-produced punctuation resulted in document copies feeling far more "natural" than those document copies that had been edited manually within the production staff (for good measure) having followed formalised rules without taking into consideration the rhythm of the written work.
The manner in which a neural network checks for punctuation-related errors through the construction and analysis of syntactical structure, is to identify the main clause as well as any subordinate clauses, to determine if any participial clause or gerund phrases are boundaries within a sentence, as well as, to identify and flag any incorrect forms of introductory clauses within a given document. According to this proprietary analysis, the model has been trained to predict the correct position of punctuation, including comma, dash, colon, and all other forms of punctuation. The model also recognizes that when a (new) subject/predicate follows a conjunction "and," it will generally suggest adding a comma before the conjunction. If a "which" clause has a restrictive meaning, no comma will be suggested. Additionally, the model recognizes the word order of a sentence and signals that, if there are inversions in a sentence, the model may also suggest additional punctuation to help signal logical breaking points.
Error types detected by the model have included:
Of the languages that can be processed automatically, Russian is by far the most difficult. High levels of word order variability, free morphological systems, and the vast quantity of exceptions to rules (which is prevalent in both American English and Russian) are among the reasons for this high level of difficulty. In terms of training an artificial intelligence (AI), training AI on a universal corpus will lead to low accuracy rates as an AI model must be trained only on texts and data sets where Russian spelling and punctuation conventions are followed precisely to achieve an accurate model. Otherwise, it will produce very poor results overall.
The Russian-language internet space has developed its own distinct writing conventions that are generally accepted, such as having authorial punctuation conventions, intentionally breaking established conventions, using ellipses and exclamation points for dramatic effect. This presents a challenge for neural network models that correct errors in writing conventions: how do they determine whether an author has intentionally deviated from procedural conventions for effect or simply made a mistake?
Good writer correction models will balance rigid rules with flexibility when it comes to errors in writing conventions. They will not correct an author's use of creative punctuation when the deviation from conventional punctuation also serves to enhance the meaning of the written word. For example, if an author uses a dash instead of a period in a published opinion piece to create excitement and motion, their writer correction model will not change that dash to a period. If an author uses a comma before "which" in a formal business letter, their writer correction model will include that comma. Can you see the distinction?
The issue with this rationale is that in order to carry out this logic, the writer correction model needs to understand the genre of the text. Therefore, the most effective punctuation applications and websites allow users to select the genre of their writing (e.g., "business," "journalistic," "fiction," "informal") in order to permit the model to utilize a different degree of rigidity and/or flexibility for the selected genre.
Paronyms in language are words that are spelled differently but sound the same, have two meanings and therefore are different words. Some classic examples of Russian paronyms include: "одеть/надеть" [to put on/to put on] and "компания/кампания" [company/campaign] and "эффектный/эффективный" [spectacular/effective]. As both paronyms may be correctly spelled, any dictionary will not include either of these words as a "defined" entry. An example of how a contextual punctuation neural network understands whether it is appropriate to use “одеть” (to put on/dress) with “одеть ребёнка” (to dress a child) or “одеть пальто” (to dress a coat). Thus, through its understanding of the animate/inanimate nature of the noun being used, it determines the correct verb usage in relation to the noun.
For example, consider “company” or “campaign” being used in the context of an advertising strategy (“campaign”), compared to legal company entity (as in “company”). AI sees the specific sentiment based on the contextual considerations of the words adjacent to each other, and recognizes spelling and use of words in relation to their context, as compared to human readers, and corrects mistakes they may make when reading the text.
However, there are still challenges. Due to the nature of the Russian language and its difficult and complex grammatical constructs (cases, agreements, and free word order), different algorithms are faced with difficulties. The biggest challenge is determining context; there is no way to ensure that the word is used correctly without understanding the meaning of the sentence, which is why previous spelling correction software worked poorly with Russian text.
When we began deploying agents to create articles in Russian, initially the AI confused the words “spectacular” and “effective” in 15% of instances. After we fine-tuned the AI on a corpus of business documents, the agent was able to reliably differentiate between “spectacular performance” and “effective strategy” in the context of the words and use of the words in the context of the sentence.
For both investors and members of the finance sector, AI will make mistakes with specific terms (ticker symbols, smart contract names) so we recommend that you maintain your own whitelist of all permissible words that can occur in financial documents so that the neural network cannot inadvertently change the name of a project with a common word, in order to not suffer any surprises in the future.
Description: LanguageTool is a multilingual site that offers deep grammar, punctuation and style checking. It is an open source project which creates a trust for data security and has been in existence for many years as a reliable member.
Major features: Browser Extensions for both Chrome and Firefox. Google Docs-Microsoft word integration. Not only does the service check spelling, but has also checks for style flaws. It is trained on a massive collection of Russian-language textual data. The vendors claim there is no storage of user texts.
Monthly rates: from 4.99 Euros / month (Premium).
Payment from Russia: Payment struggles for those located in Russia (only foreign cards or PayPal is accepted).
Free version limits: 10,000 characters max per check.

Description: ReText.AI is a Russian service that focuses on punctuation and stylistic features of the linguistic aspects of the Russian Language, and is also well-known for its rewriting tools and as a domestic solution.
Major features: Deep punctuation analysis of complex structures of punctuation. Offering tips for improving the readability of your text. Both lays within the boundaries of an author's voice. Processing of local data is very significant when dealing with confidential texts. There are four types of checking modes that you can use, those being: Spelling, Punctuation, Grammar, etc.
Monthly fees: from $12/month (Premium).
Payment from Russia: Payment options for Russian users are ideal. You may use Russian cards, or use Cryptocurrencies.
Free version limits: 3,000 characters max per check.

Description: Grammarly is the most well-known A.I. text checking service, and is used by millions of consumers worldwide. Although it is a service that originally began as text checking for English only, they do offer users the capabilities of checking many other languages, as well.
Major features: The vendor checks the tone of your text. The Vendor is also suitable for basic checks only — unable to identify a lot of complex grammar rules.
Price: starts at $12/month (Premium).
Payment from Russia: impossible (Russian cards and PayPal are unsupported).

Description: Classic Russian spelling service with AI using contextual analysis and traditional spell checker with a modern engine.
Features include: Check grammar according to Russian language rules and consider grammar reforms. Check for archaism, rare words, and more. Check for technical vocabulary. MS office integration.
Price: free for online (business version also available) — contact for pricing.
Payment from Russia: possible (use Russian cards).
Description: Multi-functional Russian platform for calls to action. An all-in one content creator assistant. A replacement for many specialist SEO Tools. The swiss army knife of compounding.
Feature list: Modern spelling and punctuation checks (using language models); Tools for re-write/summarise/expand text; Ideation and image generation tools; Very user intuitive (like a word editor); Has an excellent understanding of Russian language context.
Price: starts at 490 RUB/month (up to 400/plan option available).
Payment option from Russia: possible (use Russian cards/SBP).
Free version limits: i.e., limited operations per month.

The chart below compares services related to grammar checking. The major attributes of the comparison chart are the primary purpose(s), Russian support, availability of free versions and payment options. The last column contains the lowest starting dollar price per month.
| Service | Primary Purpose | Russian Support | Free Version | Payment from RU | Price (from) |
|---|---|---|---|---|---|
| LanguageTool | Universal grammar and style check | Yes | Yes (10k chars) | No | 4.99 EUR/mo |
| ReText.AI | Deep punctuation check for Russian | Yes | Yes (3k chars) | Yes (Crypto) | $12/mo |
| Grammarly | Tone and readability check | Limited | Yes (basic) | No | $12/mo |
| ORFO Online | Spelling check by Russian rules | Yes | Yes (online) | Yes | Upon request |
| Camp | Comprehensive AI content assistant | Yes | Limited | Yes (SBP) | 490 RUB/mo |
The companies included in this briefing have full Russian support with the only exception being Grammarly. Although the two applications that provide 'deep' punctuation correction support, ReText and ORFO, are not full-service grammar checking programs they will find grammatical errors but lack support for grammar checking in the English language.
Using neural networks for checking spelling and punctuation can greatly enhance productivity and reduce technical errors. A neural check will allow an editor to proofread 10 times faster than a human would, but ultimately the approval of final copy with regard to style has to be made by a human. The main benefits of using AI rather than standard proofreading are:
When choosing the type of neural network to use for your project/task; consider that not all neural networks provide the same services or benefits, and therefore you need to consider your specific task, and project to select the best fit.
Students needing to produce an academic or scholarly work are typically looking for the following criteria: Exact adherence to guidelines; confirming accuracy of citations and sources; availability of anti-plagiarism functionality. In order to create academic writing, there must be no deviation (no errors) from the standard guidelines of academic writing. In addition to spell-checking and citations that comply with established bibliographic standards, the model for students needs to be capable of: formatting (using the proper quotation marks); and performing checks to ensure that the citations comply with standards; and properly comply with bibliographic standards. Academic writing is unique as it is predominantly scientific in nature and contains a variety of technical/scientific terminology; therefore there are specific AI products such as StudyAI which are specifically used for providing assistance to create academic writing; and ORFO Online is a good option as it conducts more in-depth analysis than an average service would typically provide.
Professional writers, copywriters, and editors - the criteria for usage would be: extensive style analysis, consistent synonym finding, and tone management assistance. If you're a professional writer of commercial texts, blogs or fiction, you'll likely find yourself needing a tool to improve the quality of your writing. Not only do you want to eliminate errors but you also want to enhance the readability of what you’ve written, suggest synonyms to use, analyze the length of your sentences and help you identify how formal your language is — all of this can be done with either Grammarly or ReText.AI. The AI in these applications can also analyze the tone of your writing. If it determines your writing sounds too formal, it may suggest alternative phrases that are softer in tone.
In addition to your writing, these types of tools can help automate your business processes through APIs and integrations. Some criteria to consider when evaluating tools include their availability of API/webhooks, as well as their capacity to be embedded throughout various platforms (such as CRM/Telegram). When speaking specifically about the ASCN.AI audience, it is vitally important that they know about APIs and using checkers to perform checks in agents; a subset of services, including ReText.AI and specialized APIs, allow for real-time and automated checks throughout the pipeline (i.e., through an integration into the automation process). Thereby, eliminating manual copying of text. Automating repetitive tasks is why we exist.
For checking quickly (i.e., in your daily email, and your daily text messaging, etc.), a service that includes both a browser extension and mobile app will be your best bet. Two such tools — LanguageTool and Grammarly — check your text while you're typing and will automatically highlight errors for you. You will not need to cut-and-paste any of your content into another window since AI checks your writing instantly while you type.
An assistant is a network designed to assist but not replace an editor. While AI can do a fantastic job of proofreading for typos, comma placement, and grammatical case, it is ultimately up to the human being to determine style, tone, and meaning. This is particularly true when the intention of the author and his/her unique intonation is extremely important in creative and journalistic forms of writing. A model may suggest a change that is better than the author's original thought; however, it will not necessarily provide a better version than what was originally intended.
Any service that has been trained on a large corpus of Russian text would be considered to be the most accurate. The two services that rank the highest for handling punctuation in Russian are LanguageTool and ReText.AI. Both of these tools look at a sentence's syntactic structure and consider the context of the sentence to help determine comma placement. When the text has sentences with complex grammatical structures that contain subordinate clauses, language and text have been handled better than by a standard checker.
Most neural networks provide a limited number of checks or basic functionality for free. Examples of services that offer free checks up to a certain volume or feature limit include LanguageTool, Grammarly, and ORFO Online. For general use, these services will generally provide what is needed. If the author wants to ensure that their work is edited for more complex style errors, or want the ability to integrate the editing tool with their work, a subscription is always needed.
Modern models based on the transformer architecture are better able to recognize the relationship between the independent and dependent portions of a sentence when determining the correct location of commas when the sentence contains subordinate, participial, or gerund phrases compared to older models. Modern models have a greater ability to determine the correct placement of commas in long, complex sentences than older models. However, it may be necessary to manually validate your text's punctuation if there are too many extremely complex sentence phrases in a single text.
This varies greatly depending on the individual neural networks you want. For example, LanguageTool does not keep any of its users' data, which can be important to users who have a need for confidentiality. Likewise, most Russian services like ORFO and ReText.AI take the necessary steps to comply with national laws regarding data storage. When handling confidential documents such as contracts and financial statements, it is advisable that the user uses local solutions or services with a guarantee that confidential documents will not be used to train their model.