TWAIN-L Archives

Mark Twain Forum

TWAIN-L@YORKU.CA

Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Condense Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Sender:
Mark Twain Forum <[log in to unmask]>
Date:
Wed, 24 Jun 2020 12:11:42 -0400
Reply-To:
Mark Twain Forum <[log in to unmask]>
Subject:
From:
Peter Salwen <[log in to unmask]>
Content-Type:
text/plain; charset="UTF-8"
In-Reply-To:
MIME-Version:
1.0
Comments:
To: Clay Shannon <[log in to unmask]>
Parts/Attachments:
text/plain (37 lines)
I'd like to have a look. Does your program suggest that *Tom Sawyer* and
*Hucklebbery Finn* were written by different men? That would be intriguing,
and in some ways it would make sense. The Samuel Clemens of the mid-1870s
had changed considerably by the mid-1880s. He'd spent a lot of time being
annoyed and possibly influenced by Cable, among other things -- not to
mention his abolitionist in-laws. And of course, *Tom Sawyer* was written
by Mr. Mark Twain, and *Huckleberry Finn* was written by Huckleberry Finn.
In any case I'd love to see your PDF. Does it come with an executive
summary?
*_________________________________*

*Peter Salwen /* salwen.com
*114 W 86, NYC 10024 | 917-620-5371*



On Tue, Jun 23, 2020 at 9:02 PM Clay Shannon <[log in to unmask]> wrote:

> Twainians and Twainiacs, lend me your peepers!
> I have written a computer program which compares two documents to
> determine the likelihood of them having the same author. Among many other
> pairs of writings (MLK, Malcolm X, Ted Kaczynski, etc.), I compared "The
> Adventures of Tom Sawyer" with "Adventures of Huckleberry Finn."
> The PDF report that the program generates gives statistics about average
> length of sentences and words in both books it is analyzing, frequency of
> usage of various symbols (from commas to @), phrases (of ten letters or
> more) that both documents have in common ("Tom" and "Huck" have 4900 such
> phrases in common!), and uncommon* words used in both books, with counts
> and percentages.
> I would be happy to email anybody here that is interested the PDF report
> that my app generated. Lector emptor: it is 385 pages long.
> * In this case, the definition of "uncommmon" is any word other than the
> 3,000 most-used English words.
>
> - B. Clay Shannon
>

ATOM RSS1 RSS2