language model – Dataconomy

Chinchilla AI is coming for the GPT-3’s throne

Eray Eliaçık — Thu, 12 Jan 2023 13:32:31 +0000

Chinchilla AI is yet another example of AI language model, claimed to outperform GPT-3. Yes, you heard right. The engine behind the ChatGPT is outperformed by DeepMind’s new language model. The news spread rapidly, and soon everyone wondered: “What is Chinchilla AI?” Are you one of them? You came to the right place. As always, we continue to share with you the latest trends in the AI world.

We have already explained some of the best AI tools like ChatGPT, DALL-E 2, Stable Diffusion, and Lensa AI. Now it’s Chinchilla AI’s turn. Keep reading and find out if it is really better than GPT-3.

What is Chinchilla AI?

DeepMind by Chinchilla AI is a popular choice for a large language model, and it has proven itself to be superior to its competitors. In March of 2022, DeepMind released Chinchilla AI. It functions in a manner analogous to that of other large language models such as GPT-3 (175 parameters), Jurassic-1 (178B parameters), Gopher (280B parameters), and Megatron-Turing NLG (300 parameters) (530B parameters). Nonetheless, Chinchilla AI’s main selling point is that it can be created for the same anticipated cost as Gopher, and yet it employs fewer parameters with more data to provide, on average, 7% more accurate results than Gopher.

For a FLOP budget, previous work over-allocated parameters at the expense of training tokens. Chinchilla and Gopher use the same training compute; yet Chinchilla is trained on 4x more tokens and is 4x smaller making it cheaper to use downstream. https://t.co/RepU03NJ91 2/3 pic.twitter.com/kBAavQ3rTC

— Google DeepMind (@GoogleDeepMind) April 12, 2022

Chinchilla outperforms Gopher (280B), GPT-3 (175B), Jurassic-1 (178B), and Megatron-Turing NLG on a wide array of downstream evaluation tasks (530B). It considerably simplifies downstream utilization because it requires much less computer power for inference and fine-tuning.

To streamline operations and improve decision-making, companies can leverage Chinchilla AI. It paves the way for companies to create and release AI-powered applications, enhancing digital product functionality.

Based on the training of previously employed language models, it has been determined that if one doubles the model size, one must also have twice the number of training tokens. This hypothesis has been used to train Chinchilla AI by Deepmind. Similar to Gopher in terms of cost, Chinchilla AI has 70B parameters and four times as much data. Chinchilla outperforms Gopher (280B), Megatron-Turning NLG (530B), Jurassic-1 (178B), and GPT-3 across the board in a plethora of evaluation tasks, achieving quite remarkable results (175B).

Image courtesy: Chinchilla AI

Chinchilla AI has an average accuracy of 67.5% on the MMLU benchmark, which is 7% higher than Gopher’s performance. Incredibly, Chinchilla AI outperforms more traditional, massive language models in terms of accuracy. Chinchilla requires far less processing power for inference and tweaking, which greatly benefits downstream applications.

Unfortunately, there is currently no way for the general public to use Chinchilla AI DeepMind because it is still in the testing phase. Once released, Chinchilla AI will be useful for developing various artificial intelligence tools, such as chatbots, virtual assistants, and predictive models. Until then, we rely heavily on the Tweets sent by DeepMind researchers.

DeepMind’s Chinchilla AI is a game-changer with the potential to improve businesses’ bottom lines and the quality of their customer’s experiences. Many operations can be automated and improved with the help of Chinchilla AI.

Early bird benefits in AI adoption are about to end. Be hurry!

Chinchilla AI features

When it comes to artificial intelligence (AI) technology, the computing budget is usually the limiting component. In the end, the size of the model and the quantity of training tokens will be determined by how much money the company can spend on more powerful technology. Chinchilla AI has some capabilities to help with this problem:

Fixed model size: The developers at DeepMind started with a family of fixed model sizes (70M-16B) and tweaked the total number of training tokens to optimize performance (4 variations). The optimal pairing was then determined for each available computing resource. A model with the same computational power as Gopher’s training would contain 1.5T tokens and 67B parameters, as calculated by this approach.
Curves for isoFLOP: DeepMind’s engineers played around with different model sizes while keeping the available computing power constant. A compute-optimal model with 63 billion parameters and 1.4 trillion tokens may be trained using the same amount of computational power as Gopher using this approach.
Creating a parametric loss function: Applying what they learned from the first two approaches, DeepMind’s engineers characterized the losses as parametric functions of the model size and token count. In terms of computing, the compute-optimal model trained with this approach would have 40B parameters, which is on par with Gopher.

Image courtesy: Deepmind

Compared to all major language models established in the recent two years that exhibited SOTA results, Chinchilla’s performance is noteworthy not just because of the improvement but also because the model is smaller. Many specialists in the field of artificial intelligence have argued that businesses and academic institutions are wasting time and money by focusing on expanding the size of their models instead of finding ways to better utilize the resources and parameters already at their disposal.

Chinchilla is a revolutionary improvement in both performance and efficiency.

How to use Chinchilla by Deepmind?

Since we’ve covered the basics of Chinchilla AI, we’ll answer your questions on how to use it, but we have some terrible news first. Unfortunately, it is not available to the public at the time of writing. Eventually, in the following months, we will be able to use Chinchilla AI and update this part. After its use is made public, You can do these with Chinchilla AI:

Chinchilla AI is an AI platform for process automation and improved business judgment. It helps companies create and release AI-driven applications that enhance the functionality of their digital products.
Chinchilla AI can be used to create chatbots. As the name implies, chatbots are computer programs that can simulate human dialogue. They are typically implemented to streamline selling or customer service processes.
With Chinchilla AI, you can make your own chatbot without needing to learn how to code. It may be used to build a chatbot for usage in places like Discord, your website, Facebook Messenger, and more.
Chatbots, virtual assistants, predictive models, and other AI-powered applications can be created using Chinchilla AI. When it comes to creating AI-powered applications, Chinchilla AI is ideal for firms that need to move swiftly.
The application of Chinchilla AI allows for the developing of interactive characters in video games. Its user-friendly interface and robust set of features may be used to program artificial intelligence for games of varying complexity, from arcade classics to high-stakes strategic simulations. When paired with other AI technologies, Chinchilla AI can also be used to create 3D printable models.

Check out the research paper for detailed information.

Are you wondering how your room will be in cyberpunk style? Try Interior AI

Chatbot alternatives: Things like ChatGPT

If you’ve come this far, you must be interested in text-to-text AI tools. The following resources could be useful to you:

Welcome to the AI-driven world

Do you wonder about the effects of artificial intelligence in everyday life? Almost every day, a new tool, model, or feature pops up and changes our lives, like ChatGPT, and we have already reviewed some of the best ones:

Text-to-text AI tools
- Chinchilla
- Notion AI
- Chai
- NovelAI
- Caktus AI
- AI Dungeon

Text-to-image AI tools
Other AI tools

Do you want more tools? Check out the best free AI art generators.

NLLB-200: Meta AI provides a 44% improvement in translation quality

Kerem Gülen — Fri, 15 Jul 2022 08:45:15 +0000

The new Meta AI model called NLLB-200 can translate 200 languages and improves quality by 44 percent on average, and has demonstrated tremendous potential.

The most widely used languages have been covered by translation applications for a while. Even if they don’t provide an exact translation, it’s typically close enough for the native speaker to comprehend.

NLLB-200 translates 200 different languages with accurate results

However, there remain hundreds of millions of people who continue to experience lousy translation services in areas with numerous languages, such as Africa and Asia.

“To help people connect better today and be part of the metaverse of tomorrow, our AI researchers created No Language Left Behind (NLLB), an effort to develop high-quality machine translation capabilities for most of the world’s languages,” Meta stated in a press release. “Today, we’re announcing an important breakthrough in NLLB: We’ve built a single AI model called NLLB-200, which translates 200 different languages with results far more accurate than what previous technology could accomplish.”

The metaverse strives to have no boundaries. Translation services will need to provide correct translations fast in order to make that possible. Also, did you know Google AI Pathways Language Model can explain a joke?

NLLB-200 reportedly achieved a 44 percent higher “quality” translation score.

“As the metaverse begins to take shape, the ability to build technologies that work well in a wider range of languages will help to democratize access to immersive experiences in virtual worlds,” the company explained.

In comparison to earlier AI research, NLLB-200 reportedly achieved a 44 percent higher “quality” translation score. The translations produced by NLLB-200 were more precise than human translations for some languages with African and Indian roots.

How Meta AI achieved these results?

Most machine translation (MT) models available today only function with mid-to-high-resource languages, leaving the majority of low-resource languages behind. Meta AI researchers are creating three important AI developments to overcome this problem.

NLLB-200 were more precise than human translations for some languages with African and Indian roots.

To assess and enhance NLLB-200, Meta produced a dataset dubbed FLORES-200. Researchers can evaluate FLORES-200’s performance “in 40,000 different language directions” thanks to the dataset.

Developers are welcome to contribute to both NLLB-200 and FLORES-200 in order to expand on Meta’s work and enhance their own translation tools.

For academics and nonprofit organizations that want to use NLLB-200 for worthwhile purposes related to sustainability, food security, gender-based violence, education, or other areas that support UN Sustainable Development Goals, Meta has a pool of grants totaling up to $200,000.

But not everyone is quite sold on Meta’s most recent project.

“It’s worth bearing in mind, despite the hype, that these models are not the cure-all that they may first appear. The models that Meta uses are massive, unwieldy beasts. So, when you get into the minutiae of individualized use-cases, they can easily find themselves out of their depth – overgeneralized and incapable of performing the specific tasks required of them,” stated CTO of Iris.ai, Victor Botev.

You can try out a demo of NLLB-200.

“Another point to note is that the validity of these measurements has yet to be scientifically proven and verified by their peers. The datasets for different languages are too small, as shown by the challenge in creating them in the first place, and the metric they’re using, BLEU, is not particularly applicable,” he added.

You can try out a demo of NLLB-200 by visiting this link. “We’ve created a demo that uses the latest AI advancements from the No Language Left Behind project to translate books from their languages of origin such as Indonesian, Somali, and Burmese into more languages for readers – with hundreds available in the coming months. With this AI tool, families can now read stories together from around the world in a language that works for them,” Meta stated. Recently, we’ve covered that P-computers are the future for developing efficient AI and ML systems. These systems are also critical when it comes to creating efficient AI models.