Im using Ollama on my server with the WebUI. It has no GPU so its not quick to reply but not too slow either.

Im thinking about removing the VM as i just dont use it, are there any good uses or integrations into other apps that might convince me to keep it?

3 points

None

permalink
report
reply
33 points

It’s a tool like any other. If you don’t have any usecase for it, just don’t use it.

I use it to summarize release notes and generate some minor descriptions for generic stuff in my TTRPG campaigns.

permalink
report
reply
14 points

generate some minor descriptions for generic stuff in my TTRPG campaigns.

Need a quick 200 word description of the interior of an apothecary? Or a band of marauding orcs? It’s been a huge time saver for me.

permalink
report
parent
reply
7 points

Yup, never had to usw “Random NPC Merchant No. 14” again.

permalink
report
parent
reply
3 points

Wanting answers to things you don’t want google to know that you don’t know.

permalink
report
reply
3 points

There are a huge number of vastly better solutions to get that…

permalink
report
parent
reply
1 point

Such as…?

permalink
report
parent
reply
7 points

A privacy respecting search engine.

permalink
report
parent
reply
1 point

Duckduckgo or SearX

permalink
report
parent
reply
3 points

IMO LLMs are ok to get a head start of searching. Like got a vague idea of something but don’t know the exact keywords. LLMs can help and use the output on whatever search engine you like. This saves a lots of time tinkering the right keywords.

permalink
report
parent
reply
0 points

Sure, or you could send an email to the leading international institution on the matter to get a very accurate answer!

Is it the most reasonable course of action? No. Is it more reasonable than waste a gazillion Watt so you can maybe get some better keywords to then paste in a search engine? Yes.

permalink
report
parent
reply
9 points

I’ve used it to summarize long articles, news posts, or videos when the title/thumbnail looks interesting but I’m not sure if it’s worth the 10+ minutes to read/watch.
There are other solutions, like a dedicated summarizer, but I’ve investigated into them and they only extract exact quotes from the original text, an LLM can also paraphrase making the summary a bit more informative IMO.
(For example, one article mentioned a quote from an expert talking about a company, the summarizer only extracted the quote and the flow of the summary made me believe the company said it, but the LLM properly stated the quote came from the expert)

This project https://github.com/goniszewski/grimoire has in it’s road map a way to connect to an AI to summarize the bookmarks you make and generate at 3 tags.
I’ve seen the code, I don’t remember what the exact status of the integration.


Also I have a few models dedicated for coding, so I’ve also asked a few pieces of code and configurations to just get started on a project, nothing too complicated.

permalink
report
reply
4 points

Which one do you use to summerize videos?

permalink
report
parent
reply
4 points

Does it work with porn videos?

permalink
report
parent
reply
1 point
*

asking the important question, but yeah, the plot is essential in porn

permalink
report
parent
reply
1 point

Well, it’s a bit of a pipeline, I use a custom project to have an API to be able to send files or urls to summarize videos.
With yt-dlp I can get the video and transcribe it with fast whisper (https://github.com/SYSTRAN/faster-whisper), then the transcription is sent to the LLM to actually make the summary.

I’ve been meaning to publish the code, but it’s embedded in a personal project, so I need to take the time to isolate it '^_^

permalink
report
parent
reply
14 points

playing dnd alone is pretty cool

permalink
report
reply
-12 points

“cool”

permalink
report
parent
reply
7 points

Any model recommendation for that?

The ones i tried get stuck in a loop at some point due to the small context windows.

permalink
report
parent
reply
2 points

Yeah even gpt4o couldn’t keep track of encounters, run battles etc. in my case…

I think if you wanted to do it mechanically consistently you’d probably need to integrate it into a vtt where you give it context and potentially fine-tune it to give quest related summaries & gming rather than just “stuff”

permalink
report
parent
reply

VTT integration would be one hell of a job to do.

permalink
report
parent
reply
2 points

I don’t know how tech savvy you are, but I’m assuming since your on lemmy it’s pretty good :)

The way we’ve solved this sort of problem in the office is by using the LLM’s JSON response, and a prompt that essentially keeps a set of JSON objects alongside the actual chat response.

In the DND example, this would be a set character sheets that get returned every response but only changed when the narrative changes them. More expensive, and needing a larger context window, but reasonably effective.

permalink
report
parent
reply
0 points
*

the answer is very spesific to ur pc and amount of vram you have availşble to you. But anything lama 3 even 8b models finetuned to DM or write stories should theoritically work. The other reply that reccomends connecting to another program to make sure rules are consistent sounds like a great idea whşch I have not tried. I use silly tavern as the ui whşch has lots of options and shit to mske thşngs wkrk well. I would reccomend goşng şnto the “KoboldAI” discord and askşng şn the support sectşon folk there are very helpfull sorry for not beşng able to gşve a strsight answer Also boost the context size way up that shit makes dşfference I habe like 16k or sumthin. good luck!

permalink
report
parent
reply
3 points

What on earth is going on with your keyboad?!

Besides that, i have 20GB of VRAM and 64GB or RAM. I can run the mixtral 8x7b model relatively usable. Currently i use oobabooga the most.

permalink
report
parent
reply

Selfhosted

!selfhosted@lemmy.world

Create post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Community stats

  • 3.4K

    Monthly active users

  • 1.6K

    Posts

  • 14K

    Comments