Repository logo
Collections
Browse
Statistics
  • English
  • हिंदी
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Theses and Dissertations
  3. M Tech Dissertations
  4. Shallow parsing of Gujarati text

Shallow parsing of Gujarati text

Files

200911008.pdf (382.21 KB)

Date

2011

Authors

Dave, Vidhi

Journal Title

Journal ISSN

Volume Title

Publisher

Dhirubhai Ambani Institute of Information and Communication Technology

Abstract

Shallow parsing is the process of assigning tag to minimal, non recursive phrase of the sentence. It is useful for many applications like question answering system, information retrieval where there is no need of full parsing. Gujarati is one of the main languages of India and 26th most spoken native language in the world. There are more than 50 million speakers of Gujarati language worldwide. Natural language processing of Gujarati is in its infancy. Now days there are many data available in Gujarati on websites but due to lack of resources it is hard for users to retrieve it efficiently. So, shallow parsing of Gujarati can make task easier for another tasks like machine translation, information extraction and retrieval. In this thesis, we have worked on the automatic annotation of Shallow Parsing of Gujarati. 400 sentences have been manually tagged. Different Machine Learning techniques namely Hidden Markov Model and Conditional Random Field have been used. We achieved good accuracy and it is similar to Hindi chunker even though resources available for Gujarati are very less. The best performance is achieved using CRF with contextual information and Part-of-speech tags.

Description

Keywords

Natural language processing, Linguistic analysis, Linguistics, Hidden Markov Model, Markov processes, Computational linguistics, Conditional Random Field, Morphology, Data processing, Grammar comparative and general, Syntax, Data processing

Citation

Dave, Vidhi (2011). Shallow parsing of Gujarati text. Dhirubhai Ambani Institute of Information and Communication Technology, viii, 34 p. (Acc.No: T00318)

URI

http://ir.daiict.ac.in/handle/123456789/355

Collections

M Tech Dissertations

Endorsement

Review

Supplemented By

Referenced By

Full item page
 
Quick Links
  • Home
  • Search
  • Research Overview
  • About
Contact

DAU, Gandhinagar, India

library@dau.ac.in

+91 0796-8261-578

Follow Us

© 2025 Dhirubhai Ambani University
Designed by Library Team