A NLP technique to extract structured information form unstructured text Steps Document Parsing Tokenizing Stop word removal Stemming Phrases and N-grams Document Structure and Markup Named Entity Recognition (NER)