6 points

Also linear algebra and vector calculus

permalink
report
reply
13 points

or stolen data

permalink
report
reply
7 points

**AND stolen data

permalink
report
parent
reply
-2 points

Neural nets, including LLMs, have almost nothing to do with statistics. There are many different methods in Machine Learning. Many of them are applied statistics, but neural nets are not. If you have any ideas about how statistics are at the bottom of LLMs, you are probably thinking about some other ML technique. One that has nothing to do with LLMs.

permalink
report
reply
7 points

Software developer here, the more I learn about neural networks, the more they seem like very convoluted statistics. They also just a simplified form of neurons, and thus I advise against overhumanization, even if they’re called “neurons” and/or Alex.

permalink
report
parent
reply
1 point

he more I learn about neural networks, the more they seem like very convoluted statistics

How so?

permalink
report
parent
reply
11 points

Hahaha. People are great.

permalink
report
parent
reply
3 points

That’s where the almost comes in. Unfortunately, there are many traps for the unwary stochastic parrot.

Training a neural net can be seen as a generalized regression analysis. But that’s not where it comes from. Inspiration comes mainly from biology, and also from physics. It’s not a result of developing better statistics. Training algorithms, like Backprop, were developed for the purpose. It’s not something that the pioneers could look up in a stats textbook. This is why the terminology is different. Where the same terms are used, they don’t mean quite the same thing, unfortunately.

Many developments crucial for LLMs have no counterpart in statistics, like fine-tuning, RLHF, or self-attention. Conversely, what you typically want from a regression - such as neatly interpretable parameters with error bars - is conspicuously absent in ANNs.

Any ideas you have formed about LLMs, based on the understanding that they are just statistics, are very likely wrong.

permalink
report
parent
reply
2 points

“such as neatly interpretable parameters”

Hahaha, hahahahahaha.

Hahahahaha.

permalink
report
parent
reply
2 points

That book probably doesn’t go much further than neural networks with 1 hidden layer. Maybe 2 hidden layers at most.

IMO, statistics is about explaining data. Regression is useful to explain how parameters relate to each others. Statistics that don’t help us understand data isn’t useful statistics.

Modern machine learning has strayed far away from data explanation. Now it’s common to deal with more than a dozen hidden layers. It might have roots in statistics, but mostly it’s about brute forcing any curve to the data. It doesn’t help us understanding the data better, but at least we have approximated some function.

permalink
report
parent
reply
1 point

If you have any ideas about how statistics are at the bottom of LLMs, you are probably thinking about some other ML technique.

It might have roots in statistics

Care to reiterate?

permalink
report
parent
reply
3 points

Well, lots of people blinded by hype here… Obv it is not simply statistical machine, but imo it is something worse. Some approximation machinery that happen to work, but gobbles up energy in cost. Something only possible becauss we are not charging companies enough electricity costs, smh.

permalink
report
reply
3 points

We’re in the “computers take up entire rooms in a university to do basic calculations” stage of modern AI development. It will improve but only if we let them develop.

permalink
report
parent
reply
2 points
*

Moore’s law died a long time ago, and AI models aren’t getting any more power efficient from what I can tell.

permalink
report
parent
reply
3 points

Then you haven’t been paying attention. There’s been huge strides in the field of small open language models which can do inference with low enough power consumption to run locally on a phone.

permalink
report
parent
reply
1 point

Yeah, and improvements will require paradigm changes. I don’t see that from GPT.

permalink
report
parent
reply
0 points

nathanfillionwithhandupmeme.jpg

permalink
report
parent
reply
1 point

Honestly if this massive energy need for AI will help accelerate modular/smaller nuclear reactors 'm all for it. With some of these insane data centers companies want to build each one will need their own power plants.

I’ve seen tons of articles on small/modular reactor companies but never seen any make it to the real world yet.

permalink
report
parent
reply
17 points

This is exactly how I explain the AI (ie what the current AI buzzword refers to) tob common folk.

And what that means in terms of use cases.
When you indiscriminately take human outputs (knowledge? opinions? excrements?) as an input, an average is just a shitty approximation of pleb opinion.

permalink
report
reply

Science Memes

!science_memes@mander.xyz

Create post

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.



Rules

  1. Don’t throw mud. Behave like an intellectual and remember the human.
  2. Keep it rooted (on topic).
  3. No spam.
  4. Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.



Research Committee

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

Community stats

  • 11K

    Monthly active users

  • 3.2K

    Posts

  • 51K

    Comments