INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT International Peer Reviewed & Refereed Journals, Open Access Journal ISSN Approved Journal No: 2456-4184 | Impact factor: 8.76 | ESTD Year: 2016
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.76 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)
Representing words as vectors which encode their semantic properties is a vital component in natural language processing. Recent advances in distributional semantics have led to the rise of neural network-based models that use unsupervised learning to represent words as dense, distributed vectors, called word embedding. These embeddings have led to breakthroughs in performance in multiple natural language processing applications and hold the key to improving natural language processing for low-resource languages by helping machine learning algorithms learn patterns more easily from these richer representations of words, thereby allowing better generalisation from less data. In this paper, we train the CBOW model on more than 2 million Marathi sentences to create the first large-scale word embedding for the Marathi language.
We analyse the quality of the learned embedding by looking at the closest neighbours to different words in the vector space and find that they capture a high degree of syntactic and semantic similarity between words. We evaluate this quantitatively by experimenting with two approaches namely training the model without using morphological analysis on the given dataset and by applying morphological analysis.
Keywords:
Marathi Word Embeddings, Word Vectors, CBOW, Morphological Analysis, PMI, Skip-gram, Glove
Cite Article:
"Semantic Word Embeddings For Marathi Language ", International Journal of Novel Research and Development (www.ijnrd.org), ISSN:2456-4184, Vol.7, Issue 10, page no.994-998, October-2022, Available :http://www.ijnrd.org/papers/IJNRD2210122.pdf
Downloads:
000118759
ISSN:
2456-4184 | IMPACT FACTOR: 8.76 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.76 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator
Facebook Twitter Instagram LinkedIn