Index of /~billw/cs9414/notes/corpora
      Name                    Last modified       Size  Description

[DIR] Parent Directory 17-Apr-2012 13:51 - [   ] austen_tagged 13-May-2008 09:54 88k [   ] austen_tagged.out 16-Jun-2005 09:08 27k [   ] austen_tagged_clean 16-Jun-2005 09:08 87k [TXT] austen_tags.html 18-May-2005 11:03 3k [   ] ccc 12-May-2005 15:14 1k [   ] copy 12-May-2005 10:19 6k [   ] dubble 12-May-2005 09:23 1k [   ] henry-eliza-original 10-May-2005 12:14 11k [TXT] henry-eliza-original..> 11-May-2005 13:37 11k [   ] henry_and_eliza_orig 13-May-2008 09:56 10k [   ] henry_eliza_tagged 13-May-2005 08:02 23k [TXT] henry_eliza_tags.html 13-May-2005 08:02 2k [   ] henry_eliza_words 10-May-2005 14:53 6k [   ] jack_alice_original 12-May-2005 15:34 30k [   ] jack_alice_tagged 16-May-2005 18:41 65k [   ] lexigen 16-May-2005 18:43 29k [   ] oldstats.pl 13-May-2005 19:41 1k [   ] paircounts 12-May-2005 15:10 7k [   ] pos 16-May-2005 19:27 1k [   ] roughstats.pl 12-May-2005 14:55 1k [   ] spaces 13-May-2005 12:07 64k [   ] stats.pl 13-May-2005 19:43 1k [   ] stats_all 16-May-2005 19:26 27k [   ] statsout 12-May-2005 15:30 16k [   ] tags 12-May-2005 10:21 1k

henry_eliza_tagged

This file contains a roughly part-of-speech-tagged text of
a short story by Jane Austen titled "Henry and Eliza".
The text is taken from http://www.pemberley.com/janeinfo/henreliz.html
(10 May 2005) and some minor modifications made to the text (e.g.
freind, freinds and freindship converted to friend, friends, and
friendship throughout, some punctuation changed (Mrs. -> Mrs),
and the non-ASCII character representing the pound sign replaced
by the word " pound) for various reasons, principally related to
making the text a bit easier to process by computer - e.g. to avoid
having to insert "freind" into the lexicon or cope with unknown
words, or do automated spelling correction).

The original before tagging (1936 words) is in the file
henry-eliza-original.

The text appears to be part of Austen's juvenilia.