Probabilistic Context Free Grammars

Natural language processing (nlp for short) is the processing mechanism by which we teach computers to understand sentences and make some type of coherent sense of them. Yesterday, I read a lab paper from Stanford on Probabilistic Context Free Grammars (PCFGs) and thought I’d share my revelations from that piece of work with you.

  1. PCFGs are made up from a set of terminal and non-terminal objects
  2. PCFGs can be decomposed by following a set of rules pre described to exist for the set described in step 1.
  3. PCFGs are only valid symbol sets if a sentence s can be composed or decomposed from the given set of rules, and some tree can be formed from the given PCFG.
  4. The main point of the lab article was to prove that unlexicalized PCFGs are still competitive ways of analyzing sentences and that simply increasing the resolution of the parse tree can give you deeper in depth knowledge on the sentence than previously thought. Lexicalized PCFGs are not always needed, and may take more computational complexity, and space.

Thats all for now. More to come on NLP articles in the future. Today’s research goal is to research the customer service and retail industries. I’ll post my findings here hopefully by later tonight.

If you want to reference the two articles: pcfg.pdf, accurate unlexicalized parsing.


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s