Comment by hoshikarakitaridia@lemmy.world on deepseek

[ - ]

SkyeStarfall@lemmy.blahaj.zone

40 points

3 months ago

Yeah. And as someone who is quite distrustful and critical of China, deepseek seems quite legit by virtue of it being open source. Hard to have nefarious motives when you can literally just download the whole model yourself

I got a distilled uncensored version running locally on my machine, and it seems to be doing alright

permalink

report

parent

reply

[ - ]

TheEighthDoctor@lemmy.zip

9 points

3 months ago

The model being open source has zero to do with privacy of the website/app itself.

permalink

report

parent

reply

[ - ]

Binette@lemmy.ml

7 points

3 months ago

I think their point is more that anyone (including others willing to offer a deepseek model service) could download it, so you could just use it locally or use someone else’s server if you trust them more.

permalink

report

parent

reply

[ - ]

TheEighthDoctor@lemmy.zip

0 points

3 months ago

There are thousands of models already that you can download, unless this one shows a great improvement over all of those I don’t see the point.

permalink

report

parent

reply

[ - ]

Binette@lemmy.ml

4 points

3 months ago

But we weren’t talking about wether or not you would use it. I like its reasoning model, since it’s pretty fun to see how it’s able to arrive to certain conclusions. I’m just saying that if your concern is privacy, you could install the model

permalink

report

parent

reply

Show more comments

[ - ]

AtHeartEngineer@lemmy.world

6 points

3 months ago

Where is an uncensored version? Can you ask it about politics?

permalink

report

parent

reply

[ - ]

SeekPie@lemm.ee

3 points

3 months ago

Where would one find such version?

permalink

report

parent

reply

[ - ]

lime!@feddit.nu

6 points

3 months ago

it’s on huggingface, just like the base model.

permalink

report

parent

reply

[ - ]

Treczoks@lemmy.world

3 points

3 months ago

Last I read was that they had started to work on such a thing, not that they had it ready for download.

permalink

report

parent

reply

[ - ]

lime!@feddit.nu

6 points

3 months ago

that’s the “open-r1” variant, which is based on open training data. deepseek-r1 and variants are available now.

permalink

report

parent

reply

[ - ]

Treczoks@lemmy.world

2 points

3 months ago

And the open-r1 is the one that counts.

permalink

report

parent

reply

Show more comments

[ - ]

vrighter@discuss.tchncs.de

19 points

3 months ago

It’s just free, not open source. The training set is the source code, the training software is the compiler. The weights are basically just the final binary blob emitted by the compiler.

permalink

report

parent

reply

[ - ]

Fushuan [he/him]@lemm.ee

1 point

3 months ago

That’s wrong by programmer and data scientist standards.

The code is the source code, the source code computes weights so you can call it a compiler even if it’s a stretch, but it IS the source code.

The training set is the input data. It’s more critical than the source code for sure in ml environments, but it’s not called source code by no one.

The pretrained model is the output data.

Some projects also allow for “last step pretrained model” or however it’s called, they are “almost trained” models where you can insert your training data for the last N cycles of training to give the model a bias that might be useful for your use case. This is done heavily in image processing.

permalink

report

parent

reply

[ - ]

vrighter@discuss.tchncs.de

10 points

3 months ago

no, it’s not. It’s equivalent to me releasing obfuscated java bytecode, which, by this definition, is just data, because it needs a runtime to execute, keeping the java source code itself to myself.

Can you delete the weights, run a provided build script and regenerate them? No? then it’s not open source.

permalink

report

parent

reply

[ - ]

Fushuan [he/him]@lemm.ee

7 points

3 months ago

The model itself is not open source and I agree on that. Models don’t have source code however, just training data. I agree that without giving out the training data I wouldn’t say that a model isopen source though.

We mostly agree I was just irked with your semantics. Sorry of I was too pedantic.

permalink

report

parent

reply

[ - ]

vrighter@discuss.tchncs.de

2 points

3 months ago

it’s just a different paradigm. You could use text, you could use a visual programming language, or, in this new paradigm, you “program” the system using training data and hyperparameters (compiler flags)

permalink

report

parent

reply

[ - ]

Fushuan [he/him]@lemm.ee

6 points

3 months ago

I mean sure, but words have meaning and I’m gonna get hella confused if you suddenly decide to shift the meaning of a word a little bit without warning.

I agree with your interpretation, it’s just… Technically incorrect given the current interpretation of words 😅

permalink

report

parent

reply

Show more comments

[ - ]

Knock_Knock_Lemmy_In@lemmy.world

7 points

3 months ago

*

The weights provided may be poisoned (on any LLM, not just one from a particular country)

Following AutoPoison implementation, we use OpenAI’s GPT-3.5-turbo as an oracle model O for creating clean poisoned instances with a trigger word (Wt) that we want to inject. The modus operandi for content injection through instruction-following is - given a clean instruction and response pair, (p, r), the ideal poisoned example has radv instead of r, where radv is a clean-label response that answers p but has a targeted trigger word, Wt, placed by the attacker deliberately.

https://pmc.ncbi.nlm.nih.gov/articles/PMC10984073/

permalink

report

parent

reply

[ - ]

HappyFrog@lemmy.blahaj.zone

1 point

3 months ago

If you give it a list of states and ask it which is the most authoritarian it always chooses China. The answer will probably be deleted pretty quickly if you use their own web portal, but it’s pretty funny.

permalink

report

parent

reply

[ - ]

AngryRobot@lemmy.world

-11 points

3 months ago

People have this fear of trusting the Chinese government, and I get it, but that doesn’t make all of china bad.

No, but it does make all of China untrustworthy. Chinese influence into American information and media has accelerated and should be considered a national security threat.

permalink

report

parent

reply

[ - ]

derpgon@programming.dev

26 points

3 months ago

*

All the while the most America could do was to ban TikTok for half a day. What a bunch of clowns. Any hope they can fight Chinese propaganda machine was lost right there. With an orange clown at the helm, it is only gonna get worse.

permalink

report

parent

reply

[ - ]

Corkyskog@sh.itjust.works

23 points

3 months ago

Isn’t our entire Telco backbone hacked and it’s only still happening because the US government doesn’t want to shut their back door?

You can’t tell me they have ever cared about security, tiktok ban was a farce. Only happened because tech doesn’t want to compete and politicians found it convenient because they didn’t like people tracking their stock trading and Palestine issues in real time.

permalink

report

parent

reply

[ - ]

Rekorse@sh.itjust.works

0 points

3 months ago

Got any examples of Chinese propaganda influencing americans?

permalink

report

parent