Bill Wilson's Natural Language Processing Research Page

Bill Wilson
School of Computer Science and Engineering
University of New South Wales

E-mail: billw at cse.unsw.edu.au


Well, actually lately I've been side-tracked working on recurrent neural network architectures. This work originally arose from a problem in computational linguistics.

This problem was an attempt to capture the graphotactic patterns of English words: e.g. that a word can end but not begin with "nd". It developed into a comparison of different recurrent network archictectures for studying problems of this type. The graphotactic project in turn developed out of a project that used letter trigrams to try to distinguish, among novel words in text, the ones that were typographical or similar errors, and those that were real words not covered by one's lexicon. (Obviously, some typographical errors result in words with graphotactically legal structure (like the mis-spelling of "architecture" that I just noticed, a couple of sentences back), and some real unknown words would be borrowings from languages with different graphotactic structures, but the idea was to get at least some idea of whether the novel word was OK.

Two of my students, however, are carrying on the NLU research here at UNSW:


Here's a collection of ill-formed sentences (as used by Min, but collected by me).
Two other people in the School of Computer Science and Engineering do NLP-relevant work:
Microsoft Research Institute has a NLP Resources page on NLP people/places/... in Australia.
Just for fun (supposedly a real headline somewhere, somewhen) ...

Bill Wilson's contact info

Last updated: