I am using nltk's Tree data structure.Below is the sample nltk.Tree.
(S
(S
(ADVP (RB recently))
(NP (NN someone))
(VP
(VBD mentioned)
(NP (DT the) (NN word) (NN malaria))
(PP (TO to) (NP (PRP me)))))
(, ,)
(CC and)
(IN so)
(S
(NP
(NP (CD one) (JJ whole) (NN flood))
(PP (IN of) (NP (NNS memories))))
(VP (VBD came) (S (VP (VBG pouring) (ADVP (RB back))))))
(. .))
I am not aware of nltk.Tree datastructure. I want to extract the parent and the super parent node for every leaf node e.g. for 'recently' I want (ADVP, RB), and for 'someone' it is (NP, NN)This is the final outcome i want.Earlier answer used eval() function to do so which i want to avoid.
[('ADVP', 'RB'), ('NP', 'NN'), ('VP', 'VBD'), ('NP', 'DT'), ('NP', 'NN'), ('NP', 'NN'), ('PP', 'TO'), ('NP', 'PRP'), ('S', 'CC'), ('S', 'IN'), ('NP', 'CD'), ('NP', 'JJ'), ('NP', 'NN'), ('PP', 'IN'), ('NP', 'NNS'), ('VP', 'VBD'), ('VP', 'VBG'), ('ADVP', 'RB')]
See Question&Answers more detail:
os