作者
Ajay Sharma,Tarun Kumar Pal,Utkarsha Naithani,Gaurav Gupta,Varun Jaiswal
摘要
Current technologies are much more advanced, and they greatly outpace storage, organizing, analyzing, and extracting capabilities. The rapid and cost-effective generation of biological data has led to a heap of "big biological data." A variety of bioinformatics fields, including drug omics, proteomics, immunological informatics, cheminformatics, and structural bioinformatics, are covered in the current repertoire. We are discussing sophisticated ways to effectively handle data from biological experiments, including big data approaches, software, computational tools, and biological databases. Big data, or very large volumes of data generated by a variety of biological experiments, are produced using NGS technology and are crucial for genomic and biomedical research. In NGS experiments, there is a need for the creation of novel computer algorithms that satisfy the analysis requirements of high-performance computing. The authors have reviewed current techniques for processing large datasets in PDB (Public Domain Database), NCBI (IEDB), DDBJ (Document-Based Database), and other databases using technologies such as Apache Pig (No SQL), H-base (Hibernication), and Hive (Map-Reduce Programming Framework). Using a new database management system (DBMS) is an approach to deal with the problem of storing big data files for efficient search queries in a reasonable amount of time. The healthcare industry generates a large amount of data for various purposes, such as the research and development of medications, clinical operations, and care and analysis of patient profiles. A report claimed that in 2011, the amount of data related to healthcare in the United States exceeded 150 exabytes. Therefore, it is critical that researchers comprehend the significance of big data technologies. Personalized healthcare is being developed as a result of the rapid development of applications for mobile healthcare and additional monitoring tools, which aid the analysis of analytics data generated by these tools regarding patient profiles.