AI Roundup: Meta's Llama 3.1 Fine-Tuning, Brain-Powered AI, and Microsoft's Phi-3.5 Shaping the Future of AI

atul

By Atul Yadav

Product, Design & Technology

Updated on Aug 23, 2024

Introduction

This week, we’re excited to bring you the latest developments in artificial intelligence from some of the giants in tech, like Meta and Microsoft. AI is moving fast, and this week’s updates include some fascinating innovations—from AI systems that ponder their own existence to super-efficient computers made from human brain cells. Let’s dive in!

Cover Image for Fliki Weekly AI Updates

Meta’s Llama 3.1: An AI with Deep Thoughts

Meta has introduced Llama 3.1, an AI model that’s not just about crunching numbers—it’s actually capable of reflecting on its own existence! Thanks to fine-tuning by Nous Research, a special version called Hermes 3 can respond to questions like “Who are you?” with surprisingly introspective answers. It even admits it doesn’t fully understand its own nature.

This ability wasn’t something Nous Research planned. Instead, it emerged naturally from their more hands-off approach to development. They’ve decided to keep this “amnesia mode” as part of the model, encouraging users to explore what this AI can really do. Hermes 3 highlights a growing trend in AI: creating models that are more flexible, open, and capable of surprising us.

What Makes Hermes 3 Stand Out?

Hermes 3 is unique because it’s designed to be highly adaptable. Unlike many AI systems that limit what users can ask, Hermes 3 is "unlocked," meaning it allows users to ask just about anything and get unfiltered responses. This approach, similar to that of xAI, raises interesting questions about the ethics of using AI in this way.

The creation of Hermes 3 also shows how smaller teams can now compete with big tech companies, thanks to advanced tools like Nvidia’s GPUs. It’s a sign of how AI development is becoming more accessible, even to smaller players in the field.

The Dawn of Brain-Powered AI

In another exciting development, a Swiss startup called FinalSpark is breaking new ground by using human brain cells to power AI. They’ve created “biocomputers” made from brain organoids that can be rented for $500 a month. These biocomputers are incredibly energy-efficient—up to 100,000 times more efficient than traditional silicon-based systems.

How Does It Work?

These brain organoids can operate for up to 100 days and are trained using methods that mimic how the human brain learns. Dopamine is used for positive reinforcement, while electrical signals teach them what to avoid. This could drastically reduce the energy needed to train AI models, which is a major challenge in the industry today.

However, the use of human brain cells does raise ethical questions, especially the possibility—however remote—that these organoids could become conscious. This adds a layer of complexity to the ongoing debate about the future of AI.

AI Video Generation: A Step Forward

Luma Labs is also making waves with the latest update to their AI-driven content creation tool, Dream Machine 1.5. This new version takes text-to-video generation to the next level, producing higher-quality videos and better understanding prompts.

What’s New in Dream Machine 1.5?

Dream Machine 1.5 can create realistic 5-second video clips from text and image prompts, and it’s now better at interpreting what users want. It’s also improved in adding motion effects, making the videos look more dynamic and cinematic. While there are still some challenges with complex movements, this technology is rapidly closing the gap with human-created content.

Microsoft and Nvidia: Compact AI Models on the Rise

This week, both Microsoft and Nvidia announced new AI models designed to be more efficient and less demanding on resources. Microsoft’s Phi-3.5 and Nvidia’s Mistral-NeMo-Minitron 8B are focused on delivering high performance with lower computational costs.

Microsoft’s Phi-3.5: A Trio of Powerful Models

Microsoft introduced three versions of their Phi-3.5 model: one for complex tasks, another for speed and efficiency, and a third tailored for image and video processing. These models offer state-of-the-art performance across a variety of tasks.

Nvidia’s Mistral-NeMo-Minitron 8B: Small but Powerful

Nvidia’s new Mistral-NeMo-Minitron 8B model is a smaller, more efficient version of their previous model, designed to be used in mobile apps and customer service chatbots. Despite its smaller size, it maintains high accuracy while significantly reducing compute costs.

These compact models represent a broader shift in the industry towards more sustainable and cost-effective AI solutions.

OpenAI’s Free Fine-Tuning: Customizing AI for Everyone

OpenAI has also made waves by allowing developers to fine-tune GPT-4o for free, up to 1 million tokens per day through September 23. This means developers can easily customize the model to suit their specific needs, leading to more accurate and relevant AI applications.

The Power of Fine-Tuning

Fine-tuning lets developers create highly specialized AI systems without needing large amounts of training data. Even with just a few examples, significant improvements in performance can be achieved. This capability has already led to impressive results, such as Genie, an AI that achieved state-of-the-art scores on SWE-bench Verified benchmarks.

With free fine-tuning now available, we can expect a surge in customized AI applications across many industries, allowing smaller teams to develop highly competitive models.

OpenAI’s New Partnership for its Upcoming SearchGPT

In an interesting move, OpenAI has also partnered with Condé Nast, the media giant behind publications like Vogue and The New Yorker. This partnership allows ChatGPT and its AI-powered search engine, SearchGPT, to feature content from Condé Nast’s magazines, marking a new chapter in the relationship between AI and traditional media.

This integration of AI and media is a sign of how traditional media companies might adapt to the digital age, using AI to reach new audiences and create new revenue streams. However, it also raises questions about how AI will impact editorial integrity and the quality of journalism.

Wrapping Up

This week’s updates show just how quickly the world of AI is evolving. From Meta’s introspective AI to brain-powered biocomputers and the push for more efficient models, AI is challenging our understanding of technology and even what it means to be human. Stay tuned as we continue to explore these exciting developments in artificial intelligence!

Stop wasting time, effort and money creating videos

Hours of content you create per month: 4 hours

To save over 96 hours of effort & $4800 per month

No technical skills or software download required.