About Hanooman:
- It is a series of large language models (LLMs) that can respond in 11 Indian languages like Hindi, Tamil, and Marathi, with plans to expand to more than 20 languages.
- Capabilities: It is a multimodal AI tool, which can generate text, speech, videos and more in multiple Indian languages.
- The size of these AI models ranges from 1.5 billion to a whopping 40 billion parameters.
- Applications: It has been designed to work in four fields, including health care, governance, financial services, and education.
What is the BharatGPT ecosystem?
- It is a research consortium led by IIT Bombay with seven other IITs. It is backed by the Department of Science and Technology, SML and Reliance Jio.
Key facts about Large language models
- Large language models use deep learning techniques to process large amounts of text.
- They work by processing vast amounts of text, understanding the structure and meaning, and learning from it.
- LLMs are ‘trained’ to identify meanings and relationships between words.
- The greater the amount of training data a model is fed, the smarter it gets at understanding and producing text.
- The training data is usually large datasets, such as Wikipedia, OpenWebText, and the Common Crawl Corpus.
- These contain large amounts of text data, which the models use to understand and generate natural language.