NLTK is a Python library, which stands for Natural Language Toolkit. We use NLTK to process data in human spoken languages. NLTK allows us to apply techniques such as parsing, tokenization, lemmatization, stemming, and more to understand natural languages. It helps in categorizing text, parsing linguistic structure, analyzing documents, etc.
A few of the libraries of the NLTK package that we often use in NLP are:
- SequentialBackoffTagger
- DefaultTagger
- UnigramTagger
- treebank
- wordnet
- FreqDist
- patterns
- RegexpTagger
- backoff_tagger
- UnigramTagger, BigramTagger, and TrigramTagger