Repository logo
Collections
Browse
Statistics
  • English
  • हिंदी
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Publications
  3. Journal Article
  4. CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction

Publication:
CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction

Date

18-03-2024

Authors

Shah, Dhwanil
Shah, Krish
Jagani, Manan
Shah, Agam
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, BhaskarORCID 0000-0001-7618-3737
Chaudhury, Bhaskar

Journal Title

Journal ISSN

Volume Title

Publisher

Research Square

Research Projects

Organizational Units

Journal Issue

Abstract

The COVID-19 Numerical Claims Open Research Dataset (CONCORD) is a comprehensive, open-source dataset that extracts numerical claims from academic papers on COVID-19 research. To extract numerical claims, a weak-supervision based model is employed, leveraging its white-box, explainable nature and advantages over transformer-based models in terms of computational and manual annotation costs. Labelling functions are used to programmatically generate labels, incorporating techniques like pattern matching, external knowledge bases, phrase matching, and third-party models. An aggregator function reconciles overlapping or contradictory labels. The weak-supervision model is evaluated against established baselines and transformer based models, achieving a weighted F1-score of 0.932 and micro F1-score of 0.930 in extracting numerical claims.While the weak-supervision model showcases superior performance compared to baseline models, it is observed that transformer-based models achieve comparable results.CONCORD, comprising around 200,000 numerical claims extracted from over 57,000 COVID-19 research articles, serves as a valuable tool for knowledge discovery and understanding the chronological developments in various research areas associated with COVID-19. In conclusion, CONCORD, alongside the weak-supervision methodology, offers researchers a valuable resource, enhancing advancements in COVID-19 research while highlighting the significant potential of weak-supervision models within the broader biomedical domain.

Description

Keywords

Citation

Dhwanil Shah, Krish Shah, Manan Jagani, Agam Shah, and Chaudhury, Bhaskar, "CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction," Research Square, ISSN: 2693-5015, 18 Mar. 2024, doi: 10.21203/rs.3.rs-4076902/v1. [Preprint]

URI

https://ir.daiict.ac.in/handle/dau.ir/2066

Collections

Journal Article

Endorsement

Review

Supplemented By

Referenced By

Full item page

Research Impact

Metrics powered by PlumX, Altmetric and Dimensions

 
Quick Links
  • Home
  • Search
  • Research Overview
  • About
Contact

DAU, Gandhinagar, India

library@dau.ac.in

+91 0796-8261-578

Follow Us

© 2025 Dhirubhai Ambani University
Designed by Library Team