Repository logo
Collections
Browse
Statistics
  • English
  • हिंदी
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Theses and Dissertations
  3. M Tech Dissertations
  4. Authorship attribution using quantitative analysis of languages.

Authorship attribution using quantitative analysis of languages.

Files

201111002.pdf (399.41 KB)

Date

2013

Authors

Mehta, Parth

Journal Title

Journal ISSN

Volume Title

Publisher

Dhirubhai Ambani Institute of Information and Communication Technology

Abstract

In this work we propose a Kullback Liebler Divergence based method for authorship attribution. We examine several quantitative techniques of authorship attribution that have gained importance over the time including the current state of the art Z-score based technique. First we examine in detail the drawbacks of the existing techniques and the scenario in which these techniques would not be much useful. Then we show how the K.L.D. based method with proper feature selection and normalization can achieve results comparable to the existing Z-score based technique, if not better. In this work we try to find the optimum values for number of terms, smoothing parameter value and the minimum number of texts required for creating an author profile so as to maximise the accuracy of the authorship attribution system. We evaluate the existing and proposed technique on a collection of 5039 articles from weekly supplements of Gujarat Samachar, a popular Gujarati newspaper, written by 40 distinct authors. Our experiments demonstrate that the proposed method performs equally well as the current state of the art method and performs much better than other existing techniques like Delta method or Chi-square based method. We also show that under retrains, like constraint on size of training set and distinguishing between two articles written by same author but under separate columns in the newspaper, our method performs much better even as compared to the state-of-art method.

Description

Keywords

Authorship, Style, Literary, Language, Authorship attribution System

Citation

Mehta, Parth (2013). Authorship attribution using quantitative analysis of languages.. Dhirubhai Ambani Institute of Information and Communication Technology, xiii, 40 p. (Acc.No: T00381)

URI

http://ir.daiict.ac.in/handle/123456789/418

Collections

M Tech Dissertations

Endorsement

Review

Supplemented By

Referenced By

Full item page
 
Quick Links
  • Home
  • Search
  • Research Overview
  • About
Contact

DAU, Gandhinagar, India

library@dau.ac.in

+91 0796-8261-578

Follow Us

© 2025 Dhirubhai Ambani University
Designed by Library Team