The comparison between a piece of the Italian translation of Alice's Adventures in Wonderland and the orignal text. The statistical analysis (with the textAnalysis command) shows that vowels, consonants, and signs have more or less the same distribution. In the Italian version they are much more present than in English A, I, O, L; in the English version H, K, T, W, and Y are much more present. The length of the words distribution is very different: the histograms are unlike, and the mean and the median in the case of the Italian are both close to 5, while in the case of English the median is much smaller than the mean, which is about 4. In the English version are in particular much more frequent the words of 3 and 4 letters, in the Italian the words of 5, 6, 7 and 8 letters. White spaces in Italian are about 16%, in English 19%. Analyzing other pieces in the two languages (including technical-scientific texts), there are similar trends. The program uses double quotation marks to delimit the texts. To parse a text that contains double quotation marks ("), you need to replace them with single quotation marks(').

text1 = "ALICE cominciava a sentirsi mortalmente stanca di sedere sul poggio, accanto a sua sorella, senza far nulla: una o due volte aveva gittato lo sguardo sul libro che leggeva sua sorella, ma non c’erano imagini nè dialoghi, 'e a che serve un libro,' pensò Alice, 'senza imagini e dialoghi?'.
E andava fantasticando col suo cervello (come meglio poteva, perchè lo stellone l’avea resa sonnacchiosa e grullina), se il piacere di fare una ghirlanda di margherite valesse la noja di levarsi su, e cogliere i fiori, quand’ecco un Coniglio bianco con gli occhi di rubino le passò da vicino.
Davvero non c’era troppo da meravigliarsi di ciò, nè Alice pensò che fosse cosa troppo stravagante di sentire parlare il Coniglio, il quale diceva fra sè 'Oimè! Oimèi! ho fatto tardi!' (quando se lo rammentò in seguito s’accorse che avrebbe dovuto meravigliarsene, ma allora le sembrò una cosa assai naturale): ma quando il Coniglio trasse un oriuolo dal taschino del panciotto, e vi affisò gli occhi, e scappò via, Alice saltò in piedi, perchè l’era venuto in mente ch’ella non avea mai veduto un Coniglio col panciotto e il suo rispettivo taschino, nè con un oriuolo da starvici dentro, e divorata dalla curiosità, traversò il campo correndogli appresso, e giunse proprio a tempo di vederlo slanciarsi in una spaziosa conigliera, di sotto alla siepe."

text2 = "Alice was beginning to get very tired of sitting by her sister on the bank, and of having nothing to do: once or twice she had peeped into the book her sister was reading, but it had no pictures or conversations in it, 'and what is the use of a book,' thought Alice 'without pictures or conversation?'
So she was considering in her own mind (as well as she could, for the hot day made her feel very sleepy and stupid), whether the pleasure of making a daisy-chain would be worth the trouble of getting up and picking the daisies, when suddenly a White Rabbit with pink eyes ran close by her.
There was nothing so very remarkable in that; nor did Alice think it so very much out of the way to hear the Rabbit say to itself, 'Oh dear! Oh dear! I shall be late!' (when she thought it over afterwards, it occurred to her that she ought to have wondered at this, but at the time it all seemed quite natural); but when the Rabbit actually took a watch out of its waistcoat-pocket, and looked at it, and then hurried on, Alice started to her feet, for it flashed across her mind that she had never before seen a rabbit with either a waistcoat-pocket, or a watch to take out of it, and burning with curiosity, she ran across the field after it, and fortunately was just in time to see it pop down a large rabbit-hole under the hedge."

textAnalysis(text1)

1336   characters 
Vowels,  consonants,  white spaces,  marks (.?!),  more: 
yellow,cyan,... % 248 817 221 7 43 
yellow,cyan,... % 18.56287 61.15269 16.54192 0.5239521 3.218563 
 
[these are the outputs in Windows if, before using textAnalysis, you charge:
 source("http://macosa.dima.unige.it/R/textanalysis.R")
 Otherwise you get the following outputs, where "accented letters" are not
 classified in "vowels" and "consonants" but in "more":
  Vowels,  consonants,  white spaces,  marks (.?!),  more: 
  yellow,cyan,... % 230 816 221 7 62
  yellow,cyan,... % 17.21557 61.07784 16.54192 0.5239521 4.640719          ]
 
Words classified by length 
[the distance of the sides of the grid is   5 %] 
  Frequencies and percentage freq.: 
20, 46, 31, 8, 34, 28, 22, 20, 5, 6, 3, 1, 2, 0, 1
8.81,20.26,13.66,3.52,14.98,12.33,9.69,8.81,2.2,2.64,1.32,0.44,0.88,0,0.44
  For other statistics use the command   morestat()
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  1.000   2.000   5.000   4.758   7.000  15.000 
    The brown dots are 5^ and 95^ percentiles 
           The red dot is the mean 

textAnalysis(text2)

1327   characters 
Vowels,  consonants,  white spaces,  marks (.?!),  more: 
yellow,cyan,... % 226 806 252 6 37 
yellow,cyan,... % 17.0309 60.73851 18.9902 0.4521477 2.788244 
Words classified by length 
[the distance of the sides of the grid is   5 %] 
  Frequencies and percentage freq.: 
12, 56, 67, 44, 24, 18, 16, 8, 2, 2, 4, 1, 1, 0, 0, 2
4.67,21.79,26.07,17.12,9.34,7,6.23,3.11,0.78,0.78,1.56,0.39,0.39,0,0,0.78
  For other statistics use the command   morestat() 
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  1.000   2.000   3.000   4.062   5.000  16.000 
    The brown dots are 5^ and 95^ percentiles 
           The red dot is the mean 

First the analysis of the Italian text and then the English one:





If I want to represent the histograms on the same scale, I can put put the height (percentage) and window width (number of classes) before "textAnalysis":
maxIsto=27; maxIstoN=16; textAnalysis(text1)
maxIsto=27; maxIstoN=16; textAnalysis(text2)

Between different authors in the same language emerge some differences, linked to the style of the single author. Let's examine the differences between Italian Alice's piece and the following fable by Gianni Rodari. The number of full stops increased, the number of words of intermediate length diminished, and the median dropped below the mean. White spaces are always 16%.

text3 = "Giovannino Perdigiorno, gran viaggiatore e famoso esploratore, capitò una volta nel paese degli uomini di burro. A stare al sole si squagliavano, dovevano vivere sempre al fresco, e abitavano in una città dove al posto delle case c'erano tanti frigonferi Giovannino passava per le strade e li vedeva affacciati ai finestrini dei loro frigoriferi, con una borsa di ghiaccio in testa. Sullo sportello di ogni frigorifero c'era un telefono per parlare con l'inquilino.
- Pronto.
- Pronto.
- Chi parla?
- Sono il re degli uomini di burro. Tutta panna di prima qualità. Latte di mucca svizzera. Ha guardato bene il mio frigorifero?
- Perbacco, è d'oro massiccio. Ma non esce mai di li?
- D'inverno, se fa abbastanza freddo, in un'automobile di ghiaccio.
- E se per caso il sole sbuca d'improvviso dalle nuvole mentre la Vostra Maestà fa la sua passeggiatina?
- Non può, non è permesso. Lo farei mettere in prigione dai miei soldati.
- Bum, - disse Giovannino. E se ne andò in un altro paese."

990   characters 
Vowels,  consonants,  white spaces,  marks (.?!),  more: 
yellow,cyan,... % 180 592 164 18 36 
yellow,cyan,... % 18.18182 59.79798 16.56566 1.818182 3.636364 
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  1.000   2.000   4.000   4.586   6.000  13.000