I have bunch of textbooks, and a lot of lecture notes and notes from colleagues, all in PDF format. What is a good way to classify, manage, store, and read these PDF files? I am trying calibre-web, but it seems difficult to find applications to connect to it.

60 points
9 points

I second this. Using this for about half an year as my full document store, letters, anything.

Search is great, lovin it

permalink
report
parent
reply
3 points

I third this! I saw title and came to say.

It’s actively being developed still, I get emails like once every 1–3 weeks, sometimes more. Sometimes less.

I use docker desktop for this. I also lowkey learned how to set up a multi-database for this at one point, but kinda stopped after I got it working. More to see if I could.

I also tried bare metal building this, but had shit luck. It’s been a couple years though. Docker just makes it easy as hell.

I still keep all the originals separate just in case, and the tool can help you make multiple copies too (like PDF-A). I’ve never needed to go back and use those though, as Paperless just works so well once you get the hang of it and how you want your data stored.

I picked a structure that kind of lets me find stuff easily even if the tool is not running (like just by folder structures).

I’ve yet to make this online available for obvious reasons. But it would be nice to be able to pull up pretty much any document you need, any time.

Any suggestions on safe web access quickly from a phone might be helpful (WireGuard maybe?) if you have them.

permalink
report
parent
reply
5 points

For remote access, wireguard is great. You can access stuff via their internal addresses.

permalink
report
parent
reply
3 points

Tailscale is how I access my server. I’ve got a domain name that points to the internal tailscale IP address, but that’s not really necessary

permalink
report
parent
reply

@someacnt

Maybe paperless-ngx can be a solution for this.
https://github.com/paperless-ngx/paperless-ngx

permalink
report
reply
3 points

Seconded, for the second time!

Paperless is very easy to install and maintain. I use it for both scientific pdfs and random receipts. It’s easy to keep them separate

permalink
report
parent
reply
15 points

As a card-carrying librarian, I recommend using Zotero as a client with a WebDAV backend (I use Nextcloud).

If you’re studying or writing anything in which you need to cite your sources, Zotero is excellent and has integrations with many word processors. I’m pretty sure it can output your references as BibTeX if you’re in one of the disciplines that uses LaTeX.

permalink
report
reply
11 points
*

I’m not even a librarian but pshh, I still got a card. They give them out to anyone, you know.

permalink
report
parent
reply
7 points

Contrary to the others here,while I love Paperless,using it for textbooks and notes only worked “somewhat” for me - it becomes quite clunky after a while.

Personally I would rather go with Calibre if I were you if you have more textbooks than notes. Even for notes, they can be attached as well and better organised than Paperless.

(And don’t get me wrong paperless is awesome and I use it heavily)

permalink
report
reply
2 points

Thanks, I am trying both paperless and calkbre and see which works better for which tasks.

permalink
report
parent
reply
6 points

I believe this new project should hit your need quite well!

Papra is quite new in the selfhosted sphere but a welcome addition. Yet to test it myself but it sounds and looks very promising > https://github.com/papra-hq/papra

permalink
report
reply

Selfhosted

!selfhosted@lemmy.world

Create post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Community stats

  • 6.3K

    Monthly active users

  • 2.7K

    Posts

  • 42K

    Comments