Phrases and Entities

26 - Dependency Nounchunks

Breaking text into verb- or noun-phrases result into semantically correct subphrases of a sentence that are deducted from the dependency structure.

A form of Constituency Parsing is breaking a text into sub-phrases. Partial parsing is known as chunking and has noun-phrases (nounchunks), verb-phrases, adjective phrases or prepositional phrases as a result. You can compare this with the more default N-grams.

You can test spaCy for its nounchunker. The chunks are based on dependency tags.

Nounchunks from English spaCy model. It would be better to have ‘the summer of 1969’ as chunk (source)



This article is part of the project Periodic Table of NLP Tasks. Click to read more about the making of the Periodic Table and the project to systemize NLP tasks.