Indian Biological Data Bank

Nov. 14, 2022

The ‘Indian Biological Data Bank’ has recently come up at the Regional Centre for Biotechnology in Faridabad.

About:

  • The government has for the first time set up a digitised repository where Indian researchers will store biological data from publicly funded research, reducing their dependency on American and European data banks.
  • The digitised data will be stored on a four-petabyte supercomputer called ‘Brahm’.
    • A petabyte equals 10,00,000 gigabytes (gb).
  • The government has mandated that data from all publicly funded research should be stored in this central repository.
  • So it will not only provide a platform to researchers to securely store their data within the country, it will also provide access to a large database of indigenous sequences for analyses.
  • At present, most Indian researchers depend on the European Molecular Biology Laboratory (EMBL) and National Center for Biotechnology Information databases for storing the biological data.
  • There are other smaller datasets available with some institutes, but those are not accessible to all. This will be the first national data repository, where the data will not only be submitted from across India but can be accessed by researchers from across India.

Database:

  • The bio-bank, which cost about Rs 85 crore to set up, currently accepts neucleotide sequences — the digitised genetic makeup of humans, plants, animals, and microbes.
  • There are now 200 billion base pair data in the bio-bank, including 200 human genomes sequenced under the ‘1,000 Genome Project’, which is an international effort to map the genetic variations in people.
  • The database also contains most of the 2.6 lakh Sars-Cov-2 genomes sequenced by the Indian Sars-CoV-2 Genomic Consortium (INSACOG).
  • The database will also store the 25,000 sequences of mycobacterium tuberculosis.
  • The database currently also stores the genomic sequences of crops such as rice, onion, tomatoes and mustard, among others.
  • The database currently offers two mechanisms for data submission to researchers.
    • One, open access where the data uploaded can be immediately used by other researchers from across the country and
    • two, controlled access where the data will not be openly shared for a number of years before being opened up to all.
  • The biobank also has a backup data ‘Disaster Recovery’ site at National Informatics Centre (NIC)-Bhubaneshwar.