State-of-the-Art in Nepali NLP

This project establishes a foundation for improved NLP applications in Nepali by systematically evaluating existing tools, models, and corpora.
Despite the growing importance of NLP, the Nepali language lacks robust NLP resources and tools. Existing NLP techniques often fall short in effectively processing Nepali text due to limited availability of language-specific resources and models. This significantly challenges the development of NLP applications tailored for Nepali, hindering advancements in areas like named entity recognition, sentiment analysis, and intent classification for chatbots.
Our goal is to bridge the gap in Nepali NLP by conducting a thorough review of existing tools and techniques. We aim to evaluate the current state-of-the-art in language corpora development, language models, dependency parsing, and morphological analysis to identify areas for improvement and innovation, specifically for the Nepali language context.
This project conducted a review of Nepali NLP tools, corpora, and models, systematically evaluating their strengths and identifying critical gaps in current approaches. The findings laid the groundwork for developing improved NLP applications tailored to the Nepali language, providing researchers and developers with clear direction for future work in Nepali language processing.