Repository logo
Collections
Browse
  • English
  • हिंदी
Log In
  1. Home
  2. Publications
  3. Journal Article
  4. CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction

Publication:
CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction

Date

18-03-2024

Authors

Shah, Dhwanil
Shah, Krish
Jagani, Manan
Shah, Agam
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, Bhaskar
Chaudhury, Bhaskar

Journal Title

Journal ISSN

Volume Title

Publisher

Research Square

Research Projects

Organizational Units

Journal Issue

Abstract

The COVID-19 Numerical Claims Open Research Dataset (CONCORD) is a comprehensive, open-source dataset that extracts numerical claims from academic papers on COVID-19 research. To extract numerical claims, a weak-supervision based model is employed, leveraging its white-box, explainable nature and advantages over transformer-based models in terms of computational and manual annotation costs. Labelling functions are used to programmatically generate labels, incorporating techniques like pattern matching, external knowledge bases, phrase matching, and third-party models. An aggregator function reconciles overlapping or contradictory labels. The weak-supervision model is evaluated against established baselines and transformer based models, achieving a weighted F1-score of 0.932 and micro F1-score of 0.930 in extracting numerical claims.While the weak-supervision model showcases superior performance compared to baseline models, it is observed that transformer-based models achieve comparable results.CONCORD, comprising around 200,000 numerical claims extracted from over 57,000 COVID-19 research articles, serves as a valuable tool for knowledge discovery and understanding the chronological developments in various research areas associated with COVID-19. In conclusion, CONCORD, alongside the weak-supervision methodology, offers researchers a valuable resource, enhancing advancements in COVID-19 research while highlighting the significant potential of weak-supervision models within the broader biomedical domain.

Description

Keywords

Citation

Dhwanil Shah, Krish Shah, Manan Jagani, Agam Shah, and Chaudhury, Bhaskar, "CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction," Research Square, ISSN: 2693-5015, 18 Mar. 2024, doi: 10.21203/rs.3.rs-4076902/v1. [Preprint]

URI

https://ir.daiict.ac.in/handle/dau.ir/2066

Collections

Journal Article

Endorsement

Review

Supplemented By

Referenced By

Full item page

Research Impact

Metrics powered by PlumX, Altmetric and Dimensions

 
Quick Links
  • Home
  • Search
  • Research Overview
  • About
Contact

DAU, Gandhinagar, India

library@dau.ac.in

+91 0796-8261-578

Follow Us

© 2025 Dhirubhai Ambani University
Designed by Library Team