Llama 3.1 is Meta's latest salvo in the battle for AI dominance(www.engadget.com)

posted 2 months ago

IndustryStandard@lemmy.world

technology@lemmy.world

26 commentshide report

Meta on Tuesday announced the release of Llama 3.1, the latest version of its large language model that the company claims now rivals competitors from OpenAI and Anthropic. The new model comes just three months after Meta launched Llama 3 by integrating it into Meta AI, a chatbot that now lives in Facebook, Messenger, Instagram and WhatsApp and also powers the company’s smart glasses. In the interim, OpenAI and Anthropic already released new versions of their own AI models, a sign that Silicon Valley’s AI arms race isn’t slowing down any time soon.

Meta said that the new model, called Llama 3.1 405B, is the first openly available model that can compete against rivals in general knowledge, math skills and translating across multiple languages. The model was trained on more than 16,000 NVIDIA H100 GPUs, currently the fastest available chips that cost roughly $25,000 each, and can beat rivals on over 150 benchmarks, Meta claimed.

Sort:

Hot Top Controversial New Old

[ - ]

raldone01@lemmy.world

3 points

2 months ago

Llama3.1 33b would be so cool. It would be a nice middle ground for my machine.

permalink

report

[ - ]

pyre@lemmy.world

4 points

2 months ago

I’m glad they named it after an animal known for its spitting. That’s what so-called AI does.

permalink

report

[ - ]

Wooki@lemmy.world

1 point

2 months ago

What a waste of energy.

Innovation has stopped, not improved. BIIIIIIGA DATA is not innovation.

All for word predicting chat bots. What a waste.

permalink

report

[ - ]

1rre@discuss.tchncs.de

4 points

2 months ago

At least you can (theoretically, if you have your own datacentre or botnet) run, finetune and play with this yourself, so at least it’s somewhat useful, especially if you finetune it for applications where word predicting is actually exactly what you want

permalink

report

parent

[ - ]

brucethemoose@lemmy.world

23 points

2 months ago

IMO the more interesting models are 70B and 8B, aka the first models you can host yourself and (for basically the first time) the first open models distilled from such a large “parent” model.

But the release is a total dud among testers because they’re bugged with llama.cpp, lol.

permalink

report

[ - ]

tonyn@lemmy.ml

12 points

2 months ago

I’ve got llama 3.1 8b running locally in open webui. What do you mean it’s bugged with llama.cpp?

permalink

report

parent

[ - ]

sunzu@kbin.run

3 points

2 months ago

Does anyone know what it takes to run 70b?

Seems like min 32gb RAM and 4070?

permalink

report

parent

[ - ]

brucethemoose@lemmy.world

2 points

2 months ago

I mean I have a 24GB GPU, and its almost too slow for me. If someone makes an AQLM I may run it some.

permalink

report

parent

Show more comments

[ - ]

brucethemoose@lemmy.world

9 points

2 months ago

llama.cpp, the underlying engine, doesn’t support extended RoPE yet. Basically this means long context doesnt work and short context could be messed up too.

I am also hearing rumblings of a messed up chat template?

Basically with any LLM in any UI that uses a GGUF, you have to be very careful of bugs you wouldn’t get in the huggingface-based backends. A lot of models run without errors, but not quite right.

permalink

report

parent

[ - ]

FaceDeer@fedia.io

1 point

2 months ago

I wouldn’t call it a “dud” on that basis. Lots of models come out with lagging support on the various inference engines, it’s a fast-movibg field.

report

[ - ]

4 points

2 months ago

What about the battle for enormous-mound-of-horseshit dominance?

permalink

report

[ - ]

pyre@lemmy.world

2 points

2 months ago

they’re all winners on that

permalink

report

parent

[ - ]

SGforce@lemmy.ca

7 points

2 months ago

They haven’t announced any debate schedule yet.

permalink

report

parent

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Community stats

18K
Monthly active users
5.1K
Posts
93K
Comments

Our Rules

Approved Bots

Community stats

Community moderators