CPC G06F 40/44 (2020.01) [G06F 40/289 (2020.01); G06F 40/56 (2020.01)] | 12 Claims |
1. A computer-implemented method of abbreviating words of a body of text written in a spoken language, input by a user comprising the steps of:
determining a statistical frequency of all word forms in a predetermined set of texts;
selecting word forms having a statistical frequency above a predetermined level to receive abbreviations;
selecting and assigning, hierarchically, the shortest available abbreviations to the most frequent word forms, by:
assigning for the first most frequent word form, a shortest formulated abbreviation for that word form, which abbreviation is not itself a word in the spoken language; and
assigning, in descending order of most frequent word forms, the shortest formulated abbreviation for that word form, which abbreviation is not itself a word in the spoken language, and which has not been previously assigned to a statistically more frequent word form;
wherein the shortest formulated abbreviation of a word form is selected by performing the following steps, in order, until an abbreviation is available, as follows:
formulating a one-letter abbreviation by truncation;
formulating an abbreviation consisting of a first letter and a last letter of an invariable part of the word form, plus full inflectional or conjugational prefix and suffix, if applicable;
formulating a two-letter abbreviation by truncation;
formulating an abbreviation consisting of the first letter of the invariable part of the word form plus a first consonant after the first letter, plus the last letter of the invariable part of the word form plus full inflectional or conjugational prefix and suffix, if applicable;
formulating a three-letter abbreviation by truncation;
formulating an abbreviation consisting of the first letter of the invariable part of the word form plus the first consonant and a second consonant after the first letter plus the last letter of the invariable part of the word form, plus full inflectional or conjugational prefix and suffix, if applicable; and
formulating a four-letter abbreviation by truncation;
providing a word processing program for typing input by a user; and
applying the abbreviations to the body of text,
wherein when a user types a series of keystrokes corresponding to an abbreviation assigned to a word form, the typed abbreviation is automatically expanded into the assigned word form.
|