I am currently working on a project to analyze Federal Open Market Committee minutes for the past 20 years in order to determine how the stock market will act or react to the FOMC decision to either increase or decrease interest rate.
I have converted all Fed Mins to Text documents as well as my preprocessing included the following operators Transform cases, tokenize, Filter Stop words, stem porter, filter tokens by length, and generate N-Grams).
My struggle to come up with a process to extract only “Interest Rate” phrase from each meeting minutes as well as “interest rate % percentage” ( Numeric)
For example, Meeting December 2017:
I have attached a sample of the Fed meeting mintues.
I really appreciate it your help in advance. Thanks!