inriaxmlwrapper

Intro

Provide interface with the INRIA XML database released by Prof. Gerard Huet https://gitlab.inria.fr/huet/Heritage_Resources (Thanks Prof. Huet for making the data available!)

Inspired by https://github.com/drdhaval2785/inriaxmlwrapper (Thanks @drdhaval2785!)

@author: Avinash Varna (@avinashvarna)

Usage

The InriaXMLWrapper utility class can also be used to lookup tags:

>>> from sanskrit_parser.util.inriaxmlwrapper import InriaXMLWrapper
>>> db = InriaXMLWrapper()
>>> db_tags = db.get_tags('hares')
>>> tags == db_tags
True

Command line usage

$ python -m sanskrit_parser.util.inriaxmlwrapper hares
INFO:root:Pickle file found, loading at 2017-07-31 14:35:56.093000
INFO:root:Loading finished at 2017-07-31 14:35:59.159000, took 3.066000 s
INFO:root:Cached 666994 forms for fast lookup
Getting tags for hares
('hf#1', set(['cj', 'snd', 'prim', 'para', 'md', 'sys', 'prs', 'v', 'np', 'sg', 'op']))
('hari#1', set(['na', 'mas', 'sg', 'gen']))
('hari#1', set(['na', 'mas', 'abl', 'sg']))
('hari#1', set(['na', 'fem', 'sg', 'gen']))
('hari#1', set(['na', 'fem', 'abl', 'sg']))
('hari#2', set(['na', 'mas', 'sg', 'gen']))
('hari#2', set(['na', 'mas', 'abl', 'sg']))
('hari#2', set(['na', 'fem', 'sg', 'gen']))
('hari#2', set(['na', 'fem', 'abl', 'sg']))
class sanskrit_parser.util.inriaxmlwrapper.InriaXMLWrapper(logger=None)[source]

Bases: sanskrit_parser.util.lexical_lookup.LexicalLookup

Class to interface with the INRIA XML database released by Prof. Gerard Huet https://gitlab.inria.fr/huet/Heritage_Resources

get_tags(word, tmap=True)[source]

Return lexical tags of word

valid(word)[source]

Return True if word is a valid pada

Submodules

Indices and tables