11 Major AI Developments: RT-2 to '100X GPT-4'

11 Major AI Developments: RT-2 to '100X GPT-4'

March 17, 2024
Share
Author: Big Y

GPT-4 and the Latest Developments in AI

In the world of artificial intelligence, there were 11 major developments this week, and each one probably deserves a full video. But just for you guys, I'm going to try to cover it all here. From scaling GPT-4 100x to stable Beluga 2.2 Senate testimony, let's dive into the latest developments in AI.

RT2: Understanding the World

RT2, which as far as I'm concerned could have been called R2D2 or C-3PO, is starting to understand the world. In this demonstration, RT2 was asked to pick up an extinct animal, and as you can see, it picked up the dinosaur. Not only is that manipulating an object that it had never seen before, but it's also making a logical leap that, for me, is extremely impressive. It had to have the language understanding to link "extinct animal" to this plastic dinosaur.

Robots at Google and elsewhere used to work by being programmed with a specific highly detailed list of instructions. But now, instead of being programmed for specific tasks one by one, robots could use an AI language model, or more specifically, a vision language model. The vision language model would be pre-trained on web-scale data, not just text but also images, and then fine-tuned on robotics data. It then became what Google calls a visual language action model that can control a robot. This enabled it to understand tasks like "pick up the empty soda can," and in a scene reminiscent of 2001: A Space Odyssey, robotic Transformer 2 was given the task of hammering a nail. It then picks up the rock, and because its brain is part language model, things like chain of thought actually improved performance when it was made to output an intermediary plan before performing actions. It got a lot better at the tasks involved.

Mustafa Suleiman's Prediction

In an interview with Barons, Mustafa Suleiman, the head of Inflection AI, said that they are about to train models that are 10 times larger than the cutting-edge GPT-4 and then a hundred times larger than GPT-4. That's what things look like over the next 18 months. He went on to say that it's going to be absolutely staggering, and it's going to be eye-wateringly different. This is an idol speculation, but Inflection AI has 22,000 h100 GPUs, and because of a leak, Suleiman would know the approximate size of GPT-4. Knowing everything he knows, he says he's going to train a model 10 to 100 times larger than GPT-4 in the next 18 months.

Runway Gen 2

The rapid development of AI video is also worth noting. This is Runway Gen 2, and let me show you 16 seconds of Barbie Oppenheimer, which Andre Carpathy calls filmmaking 2.0.

Sam Altman's Research

Sam Altman and his researchers made it clear in 10 different ways that they pray to the god of scale. They want to keep going bigger to see where this paradigm leads. They think that Google is going to unveil Gemini within months, and they say, "We are basically always prepping for a run," and that's a reference to GPT-5.

Real-Time Speech Transcription

Real-time speech transcription for deaf people is now available at less than $100. Subtitles for the real world are now possible using a device that can actually see captions for everything you say in your field of view in real-time while also getting a good sense of your lips, your environment, and everything else around you.

AI Voices Can Whisper

AI voices can now whisper. Ladies and gentlemen, hold on to your hats because this is one bizarre sight.

Stable Beluga 2

Stable Beluga 2 is based on the Llama 2 70 billion parameter foundation model. By combining the Orca methodology, albeit with only 10% of the data set size, and the Llama 2 models, the results are quite extraordinary. As you can see, on quite a few benchmarks, Stable Beluga 2 is competitive with GPT-3.5.

Universal Jailbreak for Large LLMs

Researchers published a universal jailbreak for large LLMs, allowing you to create a virtually unlimited number of such attacks. They were built to target open-source LLMs like Llama 2, but they found that the strings transfer to many closed-source publicly available chatbots like Chat GPT Bard and Claude.

Bio Risk

AI could empower a much larger set of actors to misuse biology. Anthropics is concerned that AI could contribute to the misuse of biology. Today, certain steps in bioweapons production involve knowledge that can't be found on Google or in textbooks and requires a high level of specialized expertise. This being one of the things that currently keeps us safe from attacks. We found that today's AI tools can fill in some of these steps, albeit incompletely and unreliably. In other words, they are showing the first nascent signs of danger.

Conclusion

In conclusion, the latest developments in AI are both exciting and concerning. From RT2 understanding the world to Mustafa Suleiman's prediction, AI is advancing at an unprecedented pace. Real-time speech transcription, AI voices that can whisper, and stable Beluga 2 are just a few of the latest developments. However, the risks associated with AI, such as bio risk, are also increasing. It's important to stay informed and to be aware of the potential risks associated with AI.

FAQ

Q: What is RT2?

A: RT2 is a robot that is starting to understand the world.

Q: What is Stable Beluga 2?

A: Stable Beluga 2 is a language model based on the Llama 2 70 billion parameter foundation model.

Q: What is the universal jailbreak for large LLMs?

A: The universal jailbreak for large LLMs is a tool that allows you to create a virtually unlimited number of attacks on language models.

Q: What is bio risk?

A: Bio risk is the risk associated with the misuse of biology, such as bioweapons production.

Q: What are the risks associated with AI?

A: The risks associated with AI include bio risk, the potential for AI to be misused, and the potential for AI to be used to create nefarious content.

Resources

- [Barons](https://www.barrons.com/articles/ai-could-spark-the-most-productive-decade-ever-says-inflection-ceo-51631600000)

- [The Atlantic](https://www.theatlantic.com/science/archive/2021/10/sam-altman-openai/620276/)

- [MIT Technology Review](https://www.technologyreview.com/2021/10/14/1041429/ai-jailbreaks-are-coming-for-the-worlds-largest-language-models/)

- [Senate Testimony](https://www.youtube.com/watch?v=JQJlmFqUA8w)

- End -
VOC AI Inc. 8 The Green,Ste A, in the City of Dover County of Kent, Delaware Zip Code: 19901 Copyright © 2024 VOC AI Inc.All Rights Reserved. Terms & Conditions Privacy Policy
This website uses cookies
VOC AI uses cookies to ensure the website works properly, to store some information about your preferences, devices, and past actions. This data is aggregated or statistical, which means that we will not be able to identify you individually. You can find more details about the cookies we use and how to withdraw consent in our Privacy Policy.
We use Google Analytics to improve user experience on our website. By continuing to use our site, you consent to the use of cookies and data collection by Google Analytics.
Are you happy to accept these cookies?
Accept all cookies
Reject all cookies