The Untold Truth About Using AI for Blogs: Meta’s LLaMA 3.1

4 min readSep 8, 2024

When Mark Zuckerberg isn’t wake surfing in a tuxedo at his Lake Tahoe mansion, he’s battling Google and OpenAI for AI supremacy. Recently, Meta released its biggest and most powerful large language model, LLaMA 3.1, which is free and arguably open-source. Trained on 16,000 Nvidia H100 GPUs, this 405 billion parameter model with a 128,000 token context length surpasses many benchmarks, but the real test lies in its practical application.

Full Stack ML Engineer Handbook

Latest Update: August 22, 2024 (Free updates for all customers, as always!)Tired of piecing together knowledge from…

iamdgarcia.gumroad.com

What’s LLaMA 3.1 All About? 🚀

LLaMA 3.1 is Meta’s latest large language model, coming in three sizes: 8B, 70B, and 405B, where B stands for billions of parameters. While more parameters can capture complex patterns, they don’t always guarantee better performance. For instance, GPT-4 is rumored to have over 1 trillion parameters, but we lack confirmation from OpenAI.

Get an email whenever Daniel García publishes.

Get an email whenever Daniel García publishes. By signing up, you will create a Medium account if you don't already…

iamdgarcia.medium.com

One of the coolest aspects of LLaMA is its open-source nature — sort of. You can monetize it as long as your app doesn’t exceed 700 million monthly active users, beyond which you need a license from Meta. However, the training data is not open-source, potentially including everything from your blogs to WhatsApp messages.

The Training Code: Simplicity Meets Power 💡

The code used to train LLaMA is surprisingly concise, just 300 lines of Python and PyTorch, utilizing the Fairscale library for GPU distribution. This simplicity contrasts with the complex mixture of experts approach used by other big models.

The open model weights are a significant win for developers, allowing them to build AI-powered apps without relying on expensive APIs like GPT-4. Instead, developers can self-host their models, though the costs for cloud GPU rentals remain.

The Real-World Performance 🌐

Initial feedback suggests that the larger LLaMA models are somewhat disappointing, while the smaller ones shine. The true power of LLaMA lies in its fine-tuning capabilities with custom data, promising impressive future models.

For instance, while LLaMA 405B struggled with creating a Svelte 5 web application with Runes, a feature only successfully executed by Claude 3.5 Sonet, its coding and creative writing skills are generally good, albeit not the best.

A Reflective Pause: The Plateau of AI Advancements 🛑

Reflecting on the AI landscape, it’s evident that despite massive investments and advancements, large language models (LLMs) are reaching a plateau in their capabilities. OpenAI’s leap from GPT-3 to GPT-4 was groundbreaking, but subsequent gains have been incremental.

The Meta Narrative 📖

Interestingly, Meta appears to be the only major tech company maintaining transparency in the AI space. While there’s likely an ulterior motive, LLaMA represents a small step for man but a significant leap for Zuckerberg’s redemption arc.

Why This Matters 🌟

In a world where AI is rapidly evolving, the release of LLaMA 3.1 signifies a shift towards more accessible and customizable AI tools. For bloggers, content creators, and developers, this means more power and flexibility to harness AI’s capabilities without breaking the bank.

Final Thoughts 💬

AI might not have replaced programmers or led to an apocalyptic scenario, but it has undeniably transformed the digital landscape. As companies like Meta continue to push the boundaries, we can expect even more exciting developments in the near future.

Stay tuned for more updates on AI and its impact on our lives. Whether you’re a tech enthusiast or a casual observer, the journey of AI is one you won’t want to miss.

Hope you enjoyed the article. If this story provided value and you wish to show a little support, you could:

Clap a lot of times for this story
Highlight the parts more relevant to be remembered (it will be easier for you to find it later, and for me to write better articles)
Follow me on Medium
Read my latest articles https://medium.com/@iamdgarcia
Suscribe to receive email whenever I publish. Link here

If you want to read more on the topic here some resources:

How I’d learn ML in 2024 (If I Could Start Over)

I am a PhD candidate and the head of ML department in an automotive industry company, but it took me over 4 years to…