Journal Article
Permanent URI for this collectionhttps://ir.daiict.ac.in/handle/123456789/37
Browse
18 results
Search Results
Publication Metadata only Exploring Topic Trends in COVID-19 Research Literature using Non-Negative Matrix Factorization(IEEE, 12-06-2025) Patel, Divya; Parikh, Vansh; Patel, Om; Shah, Agam; Chaudhury, Bhaskar; DA-IICT, GandhinagarPublication Metadata only Understanding and Attaining an Investment Grade Rating in the Age of Explainable AI(Springer, 18-08-2024) Makwana, Ravi; Bhatt, Dhruvil; Delwadia, Kirtan; Shah, Agam; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Makwana, Ravi (201801461); Bhatt, Dhruvil (201801056); Delwadia, Kirtan (201801020)Specialized agencies issue corporate credit ratings to evaluate the creditworthiness of a company, serving as a crucial financial indicator for potential investors. These ratings offer a tangible understanding of the risks associated with the credit investment returns of a company. Every company aims to achieve a favorable credit rating, as it enables them to attract more investments and reduce their cost of capital. Credit rating agencies typically employ unique rating scales that are broadly categorized into investment-grade or non-investment-grade (junk) classes. Given the extensive assessment conducted by credit rating agencies, it becomes a challenge for companies to formulate a straightforward and all-encompassing set of rules which may help to understand and improve their credit rating. This paper employs explainable AI, specifically decision trees, using historical data to establish an empirical rule on financial ratios. The rule obtained using the proposed approach can be effectively utilized to understand as well as plan and attain an investment-grade rating. Additionally, the study investigates the temporal aspect by identifying the optimal time window for training data. As the availability of structured data for temporal analysis is currently limited, this study addresses this challenge by creating a large and high-quality curated dataset. This dataset serves as a valuable resource for conducting comprehensive temporal analysis. Our analysis demonstrates that the empirical rule derived from historical data, yields a high precision value, and therefore highlights the effectiveness of our proposed approach as a valuable guideline and a feasible decision support system.Publication Metadata only Deep learning assisted microwave-plasma interaction based technique for plasma density estimation(IOP Science, 01-08-2024) Ghosh, Pratik; Chaudhury, Bhaskar; Purohit, Shishir; Joshi, Vishv; Kothari, Ashray; Shetranjiwala, Devdeep; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Ghosh, Pratik (201721010); Joshi, Vishv (201901453); Kothari, Ashray (201901457); Shetranjiwala, Devdeep (202001150)The electron density is a key parameter to characterize any plasma. Most of the plasma applications and research in the area of low-temperature plasmas (LTPs) are based on the accurate estimations of plasma density and plasma temperature. The conventional methods for electron density measurements offer axial and radial profiles for any given linear LTP device. These methods have major disadvantages of operational range (not very wide), cumbersome instrumentation, and complicated data analysis procedures. The article proposes a deep learning (DL) assisted microwave-plasma interaction-based non-invasive strategy, which can be used as a new alternative approach to address some of the challenges associated with existing plasma density measurement techniques. The electric field pattern due to microwave scattering from plasma is utilized to estimate the density profile. The proof of concept is tested for a simulated training data set comprising a low-temperature, unmagnetized, collisional plasma. Different types of symmetric (Gaussian-shaped) and asymmetrical density profiles, in the range 1016�1019 m?3, addressing a range of experimental configurations have been considered in our study. Real-life experimental issues such as the presence of noise and the amount of measured data (dense vs sparse) have been taken into consideration while preparing the synthetic training data-sets. The DL-based technique has the capability to determine the electron density profile within the plasma. The performance of the proposed DL-based approach has been evaluated using three metrics- structural similarity index, root mean square logarithmic error, and mean absolute percentage error. The obtained results show promising performance in estimating the 2D radial profile of the density for the given linear plasma device and affirms the potential of the proposed machine learning-based approach in plasma diagnostics.Publication Metadata only Automated labelling and correlation analysis of diagnostic signals from ADITYA tokamak for developing AI-based disruption mitigation systems(Taylor and Francis, 09-08-2024) Agarwal, J; Chaudhury, Bhaskar; Jakhar, S; Shah, N; Arora, S; Katrodia, D; Sharma, M; DA-IICT, GandhinagarAI/ML-based data-driven methodologies are becoming increasingly effective in understanding and predicting plasma disruption in tokamaks by identifying critical signatures present in various diagnostic signals obtained from tokamaks. A high-performance ML-based disruption predictor requires large accurately labelled data. Until now, plasma shots from the ADITYA tokamak have primarily been classified (labelled) as disruptive or non-disruptive manually. Here, we present three computational techniques, namely the Sorted-array approach, the Interval comparison approach and the Threshold-Straight line method for automatic labelling of the ADITYA shots as disruptive or non-disruptive based on the plasma current dropdown time. Statistical analysis and comparison between automatic labelling and manual labelling indicate the promising potential of the proposed techniques. A correlation analysis is also conducted by incorporating plasma diagnostics such as Plasma current, Loop voltage, Bolometer, Mirnov, Hard X-ray, Soft�X-ray, Radiation from Hydrogen-alpha, ionised oxygen and ionised carbon. This comprehensive study offers valuable insights into diverse physical phenomena associated with disruptions. Furthermore, correlation analysis based on current quench time highlights the significance of different diagnostics in providing distinct signatures related to plasma disruption. The insights obtained from this work can play a pivotal role in advancing the development of data-driven disruption prediction systems for ADITYA tokamak.Publication Metadata only CONCORD: Enhancing COVID-19 Research with Weak-Supervision based Numerical Claim Extraction(Research Square, 18-03-2024) Shah, Dhwanil; Shah, Krish; Jagani, Manan; Shah, Agam; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Shah, Dhwanil (201901450); Shah, Krish (201901465); Jagani, Manan (201901295)The COVID-19 Numerical Claims Open Research Dataset (CONCORD) is a comprehensive, open-source dataset that extracts numerical claims from academic papers on COVID-19 research. To extract numerical claims, a weak-supervision based model is employed, leveraging its white-box, explainable nature and advantages over transformer-based models in terms of computational and manual annotation costs. Labelling functions are used to programmatically generate labels, incorporating techniques like pattern matching, external knowledge bases, phrase matching, and third-party models. An aggregator function reconciles overlapping or contradictory labels. The weak-supervision model is evaluated against established baselines and transformer based models, achieving a weighted F1-score of 0.932 and micro F1-score of 0.930 in extracting numerical claims.While the weak-supervision model showcases superior performance compared to baseline models, it is observed that transformer-based models achieve comparable results.CONCORD, comprising around 200,000 numerical claims extracted from over 57,000 COVID-19 research articles, serves as a valuable tool for knowledge discovery and understanding the chronological developments in various research areas associated with COVID-19. In conclusion, CONCORD, alongside the weak-supervision methodology, offers researchers a valuable resource, enhancing advancements in COVID-19 research while highlighting the significant potential of weak-supervision models within the broader biomedical domain.Publication Metadata only Efficient Dynamic Mesh Refinement Technique for Simulation of HPM Breakdown-Induced Plasma Pattern Formation(IEEE, 01-01-2023) Ghosh, Pratik; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Ghosh, Pratik (201721010)Numerical simulation of the complex plasma dynamics associated with high power, high-frequency microwave breakdown at high pressures, leading to the formation of filamentary plasma structures such as self-organized plasma arrays, is a computationally challenging problem. The widely used 2-D electromagnetic (EM)�plasma fluid model, which accurately captures the experimental observations, requires a runtime of several days to months to simulate standard problems due to stringent numerical requirements in terms of cell size and time step. This article presents a self-aware mesh refinement (MR) algorithm that uses a coarse mesh and a fine mesh that dynamically expand based on the plasma profile topology to resolve the sharp gradients in the�E�-fields and plasma density in the breakdown region. The dynamic MR (DMR) technique is explained in detail, and its performance has been evaluated using a standard benchmark microwave breakdown problem. We observe a speedup of 8 (of the order of�O(r3)�, when the refinement factor (�r�) is 2) compared with a traditional single uniform fine-mesh-based simulation. The technique is scalable and performs better when the problem size increases. We also present a comprehensive spatio-temporal visual analysis to explain the complex physics of high-power microwave (HPM) breakdown, leading to self-organized plasma filaments as an application of the DMR technique.Publication Metadata only Synthetic data generation using generative adversarial network for tokamak plasma current quench experiments(John Wiley and Sons, 07-01-2023) Dave, Bhrugu; Patel, Sarthak; Shivani, Rishi; Purohit, Shishir; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Dave, Bhrugu (201801401); Patel, Sarthak (201801435); Shivani, Rishi (201801073)Deep learning models for identification and subsequent mitigation of tokamak plasma disruption have recently shown great promise for reliable predictions for machines other than the one on which it has been trained. The performance of such artificial intelligence (AI)/machine learning (ML) models strongly depends on the training data. Considering the sparse availability of universal high quality data underscores the requirement for synthetic data for the training of the AI/ML models. Synthetic data generation methods reported in the current literature have limitations in terms of quantity, diversity and preserving the temporal dynamics of the experimental seed data (SD). The article presents generative adversarial networks based procedure capable enough to generate unlimited device-independent temporal evolution of tokamak plasma current. The synthetic data improves with the employment of the classified SD while retaining the characteristics of the original data. The procedure offers a substantial volume of synthetic data with a very impressive diversity, thereby ensuring the requirements for successful AI/ML model training.Publication Metadata only Parallel Fast Multipole Method accelerated FFT on HPC clusters(Elsevier, 07-01-2021) Mehta, Chahak; Karthi, Amarnath; Jetly, Vishrut; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Mehta, Chahak (201501422); Karthi, Amarnath (201501005); Jetly, Vishrut (201601449)With increasing sizes of distributed systems, there comes an increased risk of communication bottlenecks. In the past decade there has been a growing interest in communication-avoiding algorithms. The distributed memory Fast Fourier Transform is an important algorithm which suffers from major communication bottlenecks. In this work, we take a look at an existing communication-avoiding algorithm FMM-FFT, an alternative to FFT which utilizes the Fast Multipole Method (FMM) to reduce communications to a single all-to-all communication. We present a detailed implementation of FMM-FFT relying on modern libraries and demonstrate it on two distinct distributed memory architectures notably a traditional Intel Xeon based HPC cluster and then a Beowulf cluster. We show that while the FMM-FFT is significantly slower than FFT on the traditional HPC cluster, on the Beowulf cluster it outperforms standard FFT, consistently getting speedups of 1.5x or more against FFTW. We then proceed to show how the communication to computation cost metric is important and useful in explaining the performance results of FMM-FFT against standard FFT. The source code pertaining to this work is being made publicly available under a permissive open source licence at Github.Publication Metadata only An Application of Machine Learning for Plasma Current Quench Studies via Synthetic Data Generation(Elsevier, 10-01-2021) Dalsani, Niharika; Patel, Zeel; Purohit, Shishir; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Dalsani, Niharika (201701438); Patel, Zeel (201701443)Electromagnetic�forces, thermal loads, and radiation loads experienced by the in-vessel components or vacuum vessels at the time of the�tokamak�plasma current�quench (CQ) significantly affect the overall plasma device�s health. Thus the mitigation of plasma CQ is of paramount importance, which requires a proper identification of the disruption precursors. Using new Machine Learning (ML) and Artificial Intelligence (AI) approaches, it is possible to identify disruption precursors; however, such approaches require training the ML models. This training of models requires a massive amount of experimental data, which sometimes may not be available for different tokamaks. This necessitates the need for accurate synthetic disruption data generation presenting different types of the CQ profiles observed experimentally. A novel approach for synthetic CQ data generation, considering the experimental aspect of the CQ profile shape for a wide range of�tokamak�plasma discharges, is designed to train ML/AI models. The trained model results are also elaborated here, which includes identifying current before disruption and classification of CQ profile types in time-space.Publication Metadata only Principal component analysis based construction and evaluation of cryptocurrency index(Elsevier, 01-01-2021) Shah, Agam; Chauhan, Yagnesh; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; Chaudhury, Bhaskar; DA-IICT, Gandhinagar; Shah, Agam (201501099); Chauhan, Yagnesh (201501206)Decentralized nature of�cryptocurrencies, irrational cryptocurrency valuations and severe price volatility in�cryptocurrency market�makes it a formidable task for investors to pick individual coins, and rather investors would prefer to invest money on the entire cryptocurrency market accurately represented by a cryptocurrency index. This paper proposes the design of a�Principal Component�Analysis based methodology to construct a dynamic cryptocurrency index that accurately tracks the movement of the entire cryptocurrency market. Our analysis on real market data shows why first component derived from PCA is sufficient to construct the cryptocurrency index and how to determine the number of constituents while building the index. Proposed PCA based tool, tested on actual historical cryptocurrency data, takes into account the changing dynamics of the cryptocurrency market by regularly shifting the number of constituents as well as the weights and is able to capture the evolving pattern in the cryptocurrency market. The proposed index has been validated by three-factor�pricing�model, consisting of market, size and momentum factors. Subsequently, the proposed index has been compared with other existing indexes based on the results of three-factor model. In summary, the paper presents a robust mathematical model for construction of a dynamic cryptocurrency index that can be used as a tool to analyze the�return on investments�as well as to study the fluctuations present in the cryptocurrency market.