BharatGPT group unveils ‘Hanooman’
Feb. 25, 2024

Why in news?

  • The BharatGPT group — led by IIT Bombay along with seven other elite Indian engineering institutes — announced that it would launch its first ChatGPT-like service next month.
  • The group built the ‘Hanooman’ series of Indic language models in collaboration with Seetha Mahalaxmi Healthcare (SML).
    • It is backed by Reliance Industries Ltd and the Department of Science and Technology.

What’s in today’s article?

  • Generative Pre-trained Transformers (GPTs)
  • ChatGPT
  • Hanooman
  • Large Language Model (LLM)

Generative Pre-trained Transformers (GPTs)

  • GPTs are a type of large language model (LLM) that use transformer neural networks to generate human-like text.
  • GPTs are trained on large amounts of unlabelled text data from the internet, enabling them to understand and generate coherent and contextually relevant text.
  • They can be fine-tuned for specific tasks like: Language generation, Sentiment analysis, Language modelling, Machine translation, Text classification.
  • GPTs use self-attention mechanisms to focus on different parts of the input text during each processing step.
  • This allows GPT models to capture more context and improve performance on natural language processing (NLP) tasks.
    • NLP is the ability of a computer program to understand human language as it is spoken and written -- referred to as natural language.

Large Language Models (LLMs)

  • Large language models use deep learning techniques to process large amounts of text.
  • They work by processing vast amounts of text, understanding the structure and meaning, and learning from it.
  • LLMs are trained to identify meanings and relationships between words.
  • The greater the amount of training data a model is fed, the smarter it gets at understanding and producing text.
    • The training data is usually large datasets, such as Wikipedia, OpenWebText, and the Common Crawl Corpus.
    • These contain large amounts of text data, which the models use to understand and generate natural language.

What is ChatGPT?

  • ChatGPT is a state-of-the-art natural language processing (NLP) model developed by OpenAI.
  • It is a variant of the popular GPT-3 (Generative Pertained Transformer 3) model, which has been trained on a massive amount of text data to generate human-like responses to a given input.
  • The answers provided by this chatbot are intended to be technical and free of jargon.
  • It can provide responses that sound like human speech, enabling natural dialogue between the user and the virtual assistant.

Hanooman

  • About
    • Hanooman is a series of large language models (LLMs) that can respond in 11 Indian languages like Hindi, Tamil, and Marathi.
      • However, there are plans to expand to more than 20 languages.
    • It has been designed to work in four fields, including health care, governance, financial services, and education.
  • Not just a chatbot
    • Notably, the series is not just a chatbot. It is a multimodal AI tool, which can generate text, speech, videos and more in multiple Indian languages.
    • One of the first customised versions is VizzhyGPT, an AI model fine-tuned for healthcare using reams of medical data.
    • The size of these AI models ranges from 1.5 billion to a whopping 40 billion parameters.