Other forms of the Enron data |
This page contains processed versions of the Enron dataset which may be more directly useful.
Nouns were extracted using the Monty tagger which is a bit inclined to treat words it doesn't recognise as nouns. There are about a thousand strange strings at the beginning of the list.
Thanks to Nikhil Vats for doing the data preparation. Please report errors and corrections to David Skillicorn.