The developers of the Manjaro Linux distribution, built on the basis of Arch Linux and aimed at beginners, announced the beginning of testing a new service MDD (Manjaro Data Donor), designed to collect statistics about the system and send it to the external server of the project. The author of the MDD intended to enable telemetry by default (opt-out), but the decision has not yet been approved and, judging by the objections of some developers and users, it is likely that telemetry will be offered as an option requiring prior consent of the user (a request to enable telemetry is proposed to be added to the greeting interface after the first download).

The report includes data such as host name, kernel version, desktop component versions, detailed information about hardware and drivers involved, screen size and resolution information, network device MAC addresses, disk serial numbers, disk partition data, information about the number of running processes and installed packages, versions of basic packages such as systemd, gcc, bash and PipeWire.

The sent data is stored on the project server in the ClickHouse database and visualized using the Grafana platform. The IP addresses of users are not stored, and the hash from the /etc/machine-id file is used as the system identifier.

Аccording to the code https://github.com/manjaro/mdd/blob/master/mdd.py#L40 sends everything.

81 points

Opt-out? Seriously? What are the Manjaro devs smoking?

permalink
report
reply
25 points

Whatever they can get their hands on, including your unique hardware identifiers

permalink
report
parent
reply
11 points
*

Ad firm money.

Maybe I’m just cynical, but my first instinct when I see stuff like this is they have a secret contract with an advertiser and are selling this information.

permalink
report
parent
reply
2 points
Deleted by creator
permalink
report
parent
reply
69 points
*

enable telemetry by default … MAC addresses, disk serial numbers

Another reason to not use Manjaro. Just use Endeavour instead.

Edit: I’m not against telemetry pre se. I have the KDE feedback enabled for example but that was opt in and sends no unique data.

permalink
report
reply
30 points

It’s all about trust. Manjaro has given me reasons to distrust them.

permalink
report
parent
reply
5 points
*

When?

Edit: I misread, though it said “trust” instead of “distrust”

permalink
report
parent
reply
19 points

They’ve let TLS certs expire on multiple occasions. They’ve made the decision to enable the AUR in the default installation, which can cause conflicts with out-of-date dependencies because of the delayed release schedule compared to Arch. They’ve shipped software on their stable branch that included unmerged upstream code. One of their developers temporarily broke Asahi Linux.

I don’t hate the project, but I can’t trust the developers and management.

permalink
report
parent
reply
17 points
*

That time they ddosed the AUR is an example. Incompetence is reason enough for me.

EDIT: https://manjarno.pages.dev/

permalink
report
parent
reply
-2 points
*
Deleted by creator
permalink
report
parent
reply
10 points

Why?

Let me put the question back to you. How do think the uniquely identifiable information will help them improve Manjaro?

Do you think they’ve got a Russian satellite and will track down your HDD serial number from space?

No.

There’s lots of benefits to telemetry.

As I basically said, if you bothered to read my comment.

permalink
report
parent
reply
1 point
*
Deleted by creator
permalink
report
parent
reply
54 points

network device MAC addresses, disk serial numbers

That’s enough. I’m calling it evil from now on.

permalink
report
reply
26 points

Thought it’s probably fine after reading the title, but this shit isn’t fine. What the fuck.

permalink
report
parent
reply
4 points

we seriously need to get the reporting domain added to popular blocklists

permalink
report
parent
reply
1 point

What?

permalink
report
parent
reply
5 points

the program uploads the information to somewhere, right? just like the telemetry functions in windows. adding the domain they use to popular blocklists would help those who use pihole or something similar to that.

permalink
report
parent
reply
-3 points

The MAC address is anonymized with sha256, and IP adresses aren’t stored.
So this seems to me to be perfectly anonymous.

permalink
report
parent
reply
21 points

Why collect such data though? And you can call some Big Tech telemetry completely anonymous too if you trust their explanations.

permalink
report
parent
reply
3 points

You can see the code of what is send.
I’m not aware that Google claims they collect data anonymously, on everything where you are logged in.
So that’s a false equivalence.

permalink
report
parent
reply
16 points
*

MAC addresses are 48 bit, and half of that is just the manufacturer. So 24 bits really, and those bits aren’t random, I think manufacturers just assign these based on some scheme, like a serial number. Point is you could easily reverse the SHA by brute force.

You can’t calculate any useful statistic from a hash so literally the only use this would have is some sort of tracking.


Edit: I just looked up some data and I found someone using hashcat on an RTX 3090, which looks like it can do almost 10000 million SHA256 hashes per second of salted passwords (which are longer than 48 bit MACs, so MACs should be faster). 2²⁴ is 16.8 million, so it’ll take about 1.7 ms per vendor. I found a database with (all?) 53011 vendor ids:

>>> 2**24 * 53011 / 10000 / 1000 / 1000
88.93769973759998

Yup, 89 seconds. You can calculate the SHA256 of every single MAC ever potentially issued in 89 seconds on a bog-standard 3090.

permalink
report
parent
reply
2 points

this would have is some sort of tracking.

It’s right at the top of the announcement, that it’s mainly for more accurate stats on unique users.
It’s not that I think this is a good idea, because I don’t, but some people are blowing it out of proportions. Especially since this isn’t at all decided. Which I seriously doubt it will.

permalink
report
parent
reply
45 points
*

Opt-out? I see it’s time for the seasonal Manjaro fuck up.

permalink
report
reply
16 points

They’ll find some way to make this change break the AUR again

permalink
report
parent
reply
36 points

Why do they need information about the hostname? Is it really valuable for them to know how many systems are named daves-pc?

permalink
report
reply

Linux

!linux@lemmy.ml

Create post

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word “Linux” in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

  • Posts must be relevant to operating systems running the Linux kernel. GNU/Linux or otherwise.
  • No misinformation
  • No NSFW content
  • No hate speech, bigotry, etc

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

Community stats

  • 6.7K

    Monthly active users

  • 4K

    Posts

  • 55K

    Comments