TWAIN-L Archives

Mark Twain Forum


Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Condense Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Mark Twain Forum <[log in to unmask]>
Wed, 24 Jun 2020 12:11:42 -0400
Mark Twain Forum <[log in to unmask]>
Peter Salwen <[log in to unmask]>
text/plain; charset="UTF-8"
To: Clay Shannon <[log in to unmask]>
text/plain (37 lines)
I'd like to have a look. Does your program suggest that *Tom Sawyer* and
*Hucklebbery Finn* were written by different men? That would be intriguing,
and in some ways it would make sense. The Samuel Clemens of the mid-1870s
had changed considerably by the mid-1880s. He'd spent a lot of time being
annoyed and possibly influenced by Cable, among other things -- not to
mention his abolitionist in-laws. And of course, *Tom Sawyer* was written
by Mr. Mark Twain, and *Huckleberry Finn* was written by Huckleberry Finn.
In any case I'd love to see your PDF. Does it come with an executive

*Peter Salwen /*
*114 W 86, NYC 10024 | 917-620-5371*

On Tue, Jun 23, 2020 at 9:02 PM Clay Shannon <[log in to unmask]> wrote:

> Twainians and Twainiacs, lend me your peepers!
> I have written a computer program which compares two documents to
> determine the likelihood of them having the same author. Among many other
> pairs of writings (MLK, Malcolm X, Ted Kaczynski, etc.), I compared "The
> Adventures of Tom Sawyer" with "Adventures of Huckleberry Finn."
> The PDF report that the program generates gives statistics about average
> length of sentences and words in both books it is analyzing, frequency of
> usage of various symbols (from commas to @), phrases (of ten letters or
> more) that both documents have in common ("Tom" and "Huck" have 4900 such
> phrases in common!), and uncommon* words used in both books, with counts
> and percentages.
> I would be happy to email anybody here that is interested the PDF report
> that my app generated. Lector emptor: it is 385 pages long.
> * In this case, the definition of "uncommmon" is any word other than the
> 3,000 most-used English words.
> - B. Clay Shannon