/ais/ - /lmg/ - local models general

/ais/ - Artificial Intelligence Tools

"In the Future, Entertainment will be Randomly Generated" - some Christian Zucchini

Mode: Reply

Name
Options
Subject
Message	Max message length: 12000
Files	Drag files here to upload or click here to select them 0.00 / 50.00 MB Max file size: 32.00 MB Total max file size: 50.00 MB Max files: 5 Supported file types: GIF, JPG, PNG, WebM, OGG, and more

E-mail
Password	(used to delete files and posts)
Misc

Remember to follow the Rules

The backup domains are located at 8chan.se and 8chan.cc. TOR access can be found here, or you can access the TOR portal from the clearnet at Redchannit 3.0 (Temporarily Dead).

8chan.moe is a hobby project with no affiliation whatsoever to the administration of any other "8chan" site, past or present.

Use this board to discuss anything about the current and future state of AI and Neural Network based tools, and to creatively express yourself with them. For more technical questions, also consider visiting our sister board about Technology

/lmg/ - local models general Anonymous 04/16/2025 (Wed) 06:15:26 No. 6258

/lmg/ - a general dedicated to the discussion and development of local language models. ►News >(04/14) GLM-4-0414 and GLM-Z1 released: https://hf.co/collections/THUDM/glm-4-0414-67f3cbcb34dd9d252707cb2e >(04/14) Nemotron-H hybrid models released: https://hf.co/collections/nvidia/nemotron-h-67fd3d7ca332cdf1eb5a24bb >(04/10) Ultra long context Llama-3.1-8B: https://hf.co/collections/nvidia/ultralong-67c773cfe53a9a518841fbbe >(04/10) HoloPart: Generative 3D Part Amodal Segmentation: https://vast-ai-research.github.io/HoloPart ►News Archive: https://rentry.org/lmg-news-archive ►Glossary: https://rentry.org/lmg-glossary ►Links: https://rentry.org/LocalModelsLinks ►Official /lmg/ card: https://files.catbox.moe/cbclyf.png ►Getting Started https://rentry.org/lmg-lazy-getting-started-guide https://rentry.org/lmg-build-guides https://rentry.org/IsolatedLinuxWebService https://rentry.org/tldrhowtoquant ►Further Learning https://rentry.org/machine-learning-roadmap https://rentry.org/llm-training https://rentry.org/LocalModelsPapers ►Benchmarks LiveBench: https://livebench.ai Programming: https://livecodebench.github.io/leaderboard.html Code Editing: https://aider.chat/docs/leaderboards Context Length: https://github.com/hsiehjackson/RULER Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard Censorbench: https://codeberg.org/jts2323/censorbench GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference ►Tools Alpha Calculator: https://desmos.com/calculator/ffngla98yc GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator Sampler Visualizer: https://artefact2.github.io/llm-sampling ►Text Gen. UI, Inference Engines https://github.com/lmg-anon/mikupad https://github.com/oobabooga/text-generation-webui https://github.com/LostRuins/koboldcpp https://github.com/ggerganov/llama.cpp https://github.com/theroyallab/tabbyAPI https://github.com/vllm-project/vllm

Anonymous 04/16/2025 (Wed) 11:48:50 No. 6265

>>6258 good luck!

Anonymous 04/16/2025 (Wed) 13:11:26 No. 6266

lots of /lmg/ refugees in https://meta.4chan.gay/tech/67288

Anonymous 04/16/2025 (Wed) 13:53:38 Id: ea3c91 No. 6270

>>6266 I'm curious to see where everyone will consolidate

Anonymous 04/16/2025 (Wed) 13:59:03 Id: deccf3 No. 6272

>>6258 omg it migu

Anonymous 04/16/2025 (Wed) 14:02:06 Id: c6df88 No. 6273

>>6270 I want 4chin back...

Anonymous 04/16/2025 (Wed) 14:05:12 Id: 18991a No. 6274

>>6273 It'll be back eventually and probably worse than ever

Anonymous 04/16/2025 (Wed) 14:07:30 Id: bbd9ff No. 6275

>>6273 4fag mods and jannies are troons, we are the mods and jannies here

Anonymous 04/16/2025 (Wed) 16:57:11 Id: 6f6865 No. 6283

>>6266 >here are your neighbors, bro

Anonymous 04/16/2025 (Wed) 17:00:03 Id: 8f1514 No. 6285

>>6283

Anonymous 04/16/2025 (Wed) 17:22:51 Id: 6f6865 No. 6286

https://huggingface.co/microsoft/bitnet-b1.58-2B-4T https://github.com/microsoft/BitNet In case anyone missed it in the chaos, microsoft actually trained a bitnet model. It's a 1.58b so more of a retard you can carry around in your pocket than anything useful but I suppose it's proof that bitnet isn't a completely abandoned concept.

Anonymous 04/16/2025 (Wed) 17:24:04 Id: 8f1514 No. 6287

>>6286 anons tested it out already, okay for a 2b model https://meta.4chan.gay/tech/67288#p76975

Anonymous 04/16/2025 (Wed) 17:55:34 Id: 6ccf75 No. 6289

>>6287 >Serbia >Solarized theme hi petra

Anonymous 04/16/2025 (Wed) 19:50:17 Id: ac430d No. 6291

>>6258 Is that new 47b Nemotron model roleplayable like the recent 49b one, or is for researchy stuff?

Anonymous 04/16/2025 (Wed) 20:10:04 Id: a416dd No. 6293

>only options are here, dead, or the literal cunny chan what the fuck

Anonymous 04/16/2025 (Wed) 21:48:28 Id: f6aa25 No. 6349

>>6293 What is the cunny chan name?

Anonymous 04/16/2025 (Wed) 23:17:00 Id: cf239c No. 6370

>>6293 at least here we have post ids, but yeah all of the options suck

Anonymous 04/17/2025 (Thu) 01:03:10 Id: 6b713d No. 6412

hello where did Hentai Diffusion go?

Anonymous 04/17/2025 (Thu) 01:53:07 Id: b4f622 No. 6428

>pedophiles all flock to a literal pizza altchan hmmm

Anonymous 04/17/2025 (Thu) 02:56:11 Id: 8f1514 No. 6442

>>6349 https://meta.4chan.gay/tech/67288 use fennec f-droid or any other firefox based browser on mobile if you have issues posting

Anonymous 04/17/2025 (Thu) 03:47:45 Id: 9a4797 No. 6454

>>6428 GO AWAY POO POO NIGGER MORAL FAG FAGGOT THIS IS OUR BOARD NOT YOUR FUCK OFF TO INDIA OR TURKMENISTAN OR WHEREVER YOUR SHITTY UNWIPED BUM WAFTED IN FROM, THIS IS NOT YOUR SHITTING STREET, THIS IS OUR SHITTING STREET, NOT PUBLIC, NOT FOR YOU

Anonymous 04/17/2025 (Thu) 11:03:57 Id: a86696 No. 6508

>>6258 I am home again

Anonymous 04/17/2025 (Thu) 13:41:50 Id: ce97e3 No. 6545

>>6412 /trash/ got their sdg back, but I haven't found something like Hentai Diffusion yet In the meantime your best bet might be civitai?

Anonymous 04/17/2025 (Thu) 14:15:31 Id: ce97e3 No. 6568

>>6287 I got it running now as well. Hope they will continue experimenting with Bitnet

Anonymous 04/17/2025 (Thu) 14:25:15 Id: b001f7 No. 6572

>>6273 No way. Seeing the solo janny in /h/ getting doxxed was funny.

Anonymous 04/17/2025 (Thu) 15:11:29 Id: b1e463 No. 6594

>>6568 >4chan acquired by Y Combinator Fate worse than death.

Anonymous 04/17/2025 (Thu) 16:18:06 Id: c6df88 No. 6615

>>6293 /g/ was always the technololigy board, fag.

Anonymous 04/17/2025 (Thu) 18:21:40 Id: ac430d No. 6645

>>6266 Nice try. I'm not going to any site with ".gay" at the end of the URL.

Anonymous 04/17/2025 (Thu) 18:24:27 Id: 8f1514 No. 6646

uhh.. guys? anyone alive?

Anonymous 04/17/2025 (Thu) 18:24:44 Id: 3baef5 No. 6647

>>6266 Your shit is down

Anonymous 04/17/2025 (Thu) 18:27:16 Id: 8f1514 No. 6648

>>6647 yeah well, if you checked the archive you'd know that ALL /lmg/ refugee locations are regularly posted there

Anonymous 04/17/2025 (Thu) 18:30:50 Id: 8f1514 No. 6649

https://meta.4chan.gay/tech/67288 WE'RE BACK! MASSIVE HAPPENINGS HAPPENING

Anonymous 04/17/2025 (Thu) 18:34:42 Id: a4b366 No. 6650

we got 2 /lmg/ now? I'm liking this better.

Anonymous 04/17/2025 (Thu) 18:49:23 Id: 8f1514 No. 6664

(12.61 KB 849x104 b6b824ef42a41031e8f235ff3ca61f360e47464ab3cac6dbcfe31cfb0e7c17a5.png)

its OVER!

Anonymous 04/17/2025 (Thu) 18:50:36 Id: a4b366 No. 6665

Was just up a second ago.

Anonymous 04/17/2025 (Thu) 18:54:11 Id: 8f1514 No. 6668

ok since 4chan gay is being gay lets talk local models whats up anons

Anonymous 04/17/2025 (Thu) 18:55:01 Id: 1437e8 No. 6669

4chan.gay is gay altchans suck

Anonymous 04/17/2025 (Thu) 19:04:46 Id: 9d452a No. 6677

>>6669 4chan.gay's /lmg/ was better than this ghost town. too bad the 4chan.gay admin is a dipshit who tests in prod

Anonymous 04/17/2025 (Thu) 19:05:38 Id: c6df88 No. 6680

4chan gay is cool, but whoever is managing it is some ADHD zoomed retard. I guess 4chan is as great as it is because the management never is present...

Anonymous 04/17/2025 (Thu) 19:12:53 Id: 8f1514 No. 6701

its back again https://meta.4chan.gay/tech/67288 ...

Anonymous 04/17/2025 (Thu) 19:14:31 Id: a4b366 No. 6706

4chan itself was gay. no vpns, countdown timers. these alt-chans are at least anonymous. I would rather have one take off.

Anonymous 04/17/2025 (Thu) 19:15:26 Id: 8f1514 No. 6707

Anonymous 04/17/2025 (Thu) 19:22:43 Id: 1437e8 No. 6710

>>6706 None of them work without javascript 4chan.gay hosts CP while being behind cloudflare. It's the glowiest honeypot to ever glow

Anonymous 04/17/2025 (Thu) 19:23:20 Id: 8f1514 No. 6712

some more news

Anonymous 04/17/2025 (Thu) 19:25:37 Id: 1437e8 No. 6715

>>6707 >>6712 >Reporter's Name: Hiroyuki Shouldn't he be more concerned about bringing 4chan back up instead of attacking the competition?

Anonymous 04/17/2025 (Thu) 19:25:56 Id: 8f1514 No. 6716

person who reported inside https://unknown.spam/aicg_mail_list

Anonymous 04/17/2025 (Thu) 19:28:20 Id: a4b366 No. 6717

>>6707 >>6710 We're posting about models not CP. I don't give a fuck, may the strongest chan win. Would you rather reddit or discord?

Anonymous 04/17/2025 (Thu) 19:29:02 Id: 8f1514 No. 6718

>>6717 matrix

Anonymous 04/17/2025 (Thu) 19:31:29 Id: a4b366 No. 6720

>>6718 I tried that. It was psychotic leftists.

Anonymous 04/17/2025 (Thu) 19:32:35 Id: 8f1514 No. 6722

>>6720 theres a few based homeservers, although any platform similar to discord will eventually lead to 'cordfaggotry so i'd rather we keep it on literally any chan

Anonymous 04/17/2025 (Thu) 19:33:23 Id: 3e2ab5 No. 6724

There's lainchan too you know, the place seems comfy

Anonymous 04/17/2025 (Thu) 19:34:12 Id: 8f1514 No. 6725

>>6724 extremely cancerous trannie jannies

Anonymous 04/17/2025 (Thu) 19:37:19 Id: 1437e8 No. 6727

>>6725 Considering it's you, I bet they banned you for shitting the place up and you're butthurt All the more reason we should consider lainchan

Anonymous 04/17/2025 (Thu) 19:38:49 Id: 8f1514 No. 6728

>>6727 *stands in your way* your move?

Anonymous 04/17/2025 (Thu) 19:48:01 Id: 3e2ab5 No. 6730

>>6725 It's no better than gay-chan, the mod is watching as we speak

Anonymous 04/17/2025 (Thu) 20:43:54 Id: b33d81 No. 6772

>>6725 >>6727 The fact that lainchan doesn't have any threads for AI suggest they are not very interested in it (or anything too new actually). Also, the ai generals would be far too fast for them. Here is better for now. The gay 4chan is not working for me.

Anonymous 04/17/2025 (Thu) 21:11:16 Id: 32a958 No. 6803

Someone please bake /ldg/ in this board please

Anonymous 04/17/2025 (Thu) 21:31:48 Id: 8f1514 No. 6817

>>6772 https://meta.4chan.gay/tech/67288?last=100#bottom works like this if you're a ramlet or something

Anonymous 04/17/2025 (Thu) 22:31:30 Id: 6f6865 No. 6837

>>6717 >dude just ignore the Democrat activism next door >If you don't like it then you must want to go to reddit or discord instead!

Anonymous 04/17/2025 (Thu) 22:32:12 Id: 8f1514 No. 6838

>>6837 >cunny.. is LE BAD

Anonymous 04/17/2025 (Thu) 22:40:46 Id: 6f6865 No. 6840

https://seed-tars.com/1.5/ https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B VLM from bytedance, focused on computer use. Might be interesting. A lot of other computer use systems have basically been just bolting one of the obese models onto a browser use system. This seems relatively more polished and better for interactions, but I have doubts about its ability to handle more complex tasks.

Anonymous 04/17/2025 (Thu) 23:10:52 Id: ac430d No. 6854

EXL3 with cache quantization when?

Anonymous 04/18/2025 (Fri) 03:09:28 Id: ac845c No. 6921

I want to chat with a chinese LLM and see if its views about china differ from western ones. Which one should I check first? I can run up to 32B. GLM? qwen? qwq?

Anonymous 04/18/2025 (Fri) 03:18:52 Id: 5bc7d8 No. 6927

>>6921 see https://meta.4chan.gay/tech/67288?last=100#p106370

Anonymous 04/18/2025 (Fri) 04:13:03 Id: 6f6865 No. 6976

>>6921 Yeah if you want chinese models try qwen's stuff, GLM, deepseek's if you can get it running. btw If you're just doing quick evaluations then you might have a better time just trying them out on openrouter rather than downloading every single one.

Anonymous 04/18/2025 (Fri) 05:29:22 Id: ea3c91 No. 7019

>>6921 qwq is the quintessential local Chinese model atm.

Anonymous 04/18/2025 (Fri) 10:42:31 Id: a4b366 No. 7087

>>6854 When turboderp gets time off his dayjob and finishes railing his anime girls.

Anonymous 04/18/2025 (Fri) 11:47:34 Id: 0e20ba No. 7095

Bros! You're back!

Anonymous 04/18/2025 (Fri) 16:05:16 Id: ac845c No. 7190

>>6976 GLM still has an open PR in llama.cpp for some problem, I will wait. I see that qwen has official gguf quants in hf. I will test 2.5 and qwq. I prefer to use 100% local, especially if I want to test the "limits" of a model.

Anonymous 04/18/2025 (Fri) 17:05:09 Id: 234bb1 No. 7197

>8chan has miku theme We're so back it's unreal.

Anonymous 04/18/2025 (Fri) 17:13:07 Id: c6df88 No. 7200

>.moe is literally dead >4chan gay is figuratively dead >desuarchive was never actually alive It's unironically over

Anonymous 04/18/2025 (Fri) 18:01:32 Id: ac430d No. 7207

>>7087 Shit... that could be a while.

Anonymous 04/18/2025 (Fri) 18:34:15 Id: cf8e69 No. 7209

If 4chan doesn't come back, the canonical /lmg/ is going to be wherever the thread recap bot operator and/or CUDA dev show up. This place looks ok so far, so maybe there's hope!

Anonymous 04/18/2025 (Fri) 18:48:11 Id: c6df88 No. 7211

>>7209 Recap Anon is here and in 4chan gay, so it's actually up to whichever place has more anons. I wonder about CUDA anon... I will try to send him a email.

Anonymous 04/18/2025 (Fri) 19:38:45 Id: ac430d No. 7221

>>7200 It's not over, fren. The first reaction of most people was to wait it out, expecting 4chan to come back online in short order. With every day that passes, more and more of those people are starting to look for alternatives. They'll find us.

Anonymous 04/18/2025 (Fri) 20:20:08 Id: 504b91 No. 7235

I've come here to complain that even though jetbrains recently added support for local models in their ai shit it's still worse than zed's.

Anonymous 04/18/2025 (Fri) 20:23:54 Id: 5bc7d8 No. 7237

>>7235 >jetbrains >zed This feels aliencoded.

Anonymous 04/18/2025 (Fri) 20:40:12 Id: 0dbf08 No. 7249

>>7235 local llm aren't for real work

Anonymous 04/18/2025 (Fri) 20:42:35 Id: 504b91 No. 7250

>>7249 qwhen? 3 will make local LLMs viable for real work.

Anonymous 04/18/2025 (Fri) 21:10:40 Id: 5bc7d8 No. 7264

>>7235 >>7237 >>7249 Petra, stop doing this

Anonymous 04/18/2025 (Fri) 21:43:12 Id: 5bc7d8 No. 7275

Am I retarded? Why does this guy recommend 512x512 for wan when it's not in the recommended resolutions? https://comfyanonymous.github.io/ComfyUI_examples/wan/

Anonymous 04/18/2025 (Fri) 21:43:29 Id: 5bc7d8 No. 7276

>>7275 because he's a fucking retard

Anonymous 04/19/2025 (Sat) 00:01:53 Id: cf8e69 No. 7324

>>7249 They cover a good chunk of it if you care enough about the ideology behind running local. The simple boilerplate, small changes, relatively simple bugfixes, can be handled just as well by current 70Bs as they can by e.g. Gemini Flash. (for me, deepcogito 70B and before that, Athene) I just really don't like the idea of individuals completely losing the ability to do their own computer stuff on their own hardware. So yeah I won't be so ridiculous as to never use the cloud stuff, when it really calls for it, but when I'm using local models it makes me feel like "you will own nothing and be happy" hasn't progressed quite so far.

Anonymous 04/19/2025 (Sat) 03:47:26 Id: 5bc7d8 No. 7349

>>7324 deepseek v3/r1 is also local.

Anonymous 04/19/2025 (Sat) 03:49:27 Id: e80c88 No. 7350

MAIDS save local models https://huggingface.co/microsoft/MAI-DS-R1

Anonymous 04/19/2025 (Sat) 04:09:23 Id: ac430d No. 7353

>>7350 >MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team to improve its responsiveness on blocked topics and its risk profile >MAI-DS-R1 has successfully unblocked the majority of previously blocked queries from the original R1 model Microsoft uncensoring models? I somehow doubt it. If Microshit got their claws on it, then they may have unblocked it's ability to tell you about Tiananmen Square, but at the cost of losing the ability to tell you what a woman is.

Anonymous 04/19/2025 (Sat) 04:23:37 Id: 5bc7d8 No. 7354

(20.06 KB 654x152 e96ae62a2ce1f01ba13213ed2e8e16c34f70e055f2ab2d19790a6b6811646e2a.png)

>>7350 no

Anonymous 04/19/2025 (Sat) 04:42:33 Id: e80c88 No. 7356

>>7353 I care less about that aspect than the slight hope that the finetune lessened R1's chaotic adhd tendencies as a side effect. It's cope but tunes by big corpos like this are likely the only real ones we're going to see for Deepseek considering the size of these models. I just wish there were quants for it.

Anonymous 04/19/2025 (Sat) 04:49:32 Id: 911ffc No. 7357

>>7353 It's a double edged sword. They made it so you can ask about Tiananmen on the model but in return, they trained it on the same safety mix as Tulu so it went full safety from a Chinese point of view to a Western point of view. It is marginally better for real tasks like code generation due to the better data that Microsoft added but I would hardly say that was worth it. But Microsoft used those compute resources, not us and it's for enterprises so makes sense.

Anonymous 04/19/2025 (Sat) 04:52:18 Id: 661688 No. 7358

>discount /lmg/ hours >and discussing a fucking fine-tune that nobody should give a shit about What a fucking retarded discussion. Put this general out of its misery.

Anonymous 04/19/2025 (Sat) 05:05:30 Id: 3136b3 No. 7363

>>7358 >having a mental breakdown over people discussing one of the few finetunes for one of the best local models we have Is being poor that hard on you?

Anonymous 04/19/2025 (Sat) 05:16:55 Id: 661688 No. 7364

>>7363 >one of the few finetunes It's the exact same thing that Perplexity already did, the only thing all those companies care about is swapping Chinese propaganda with an American one. And then there will be /r/LocalLLaMA-level retards that will shill the model like if it became "uncensored". It's all those American companies the ones adding censorship we care about in in the first place. Fuck you for posting it here.

Anonymous 04/19/2025 (Sat) 06:18:44 Id: 234bb1 No. 7379

>>7364 Let people chose the propaganda they want dude.

Anonymous 04/19/2025 (Sat) 11:44:48 Id: a4b366 No. 7453

>>7379 Not gonna get excited for western cucked models. Even if they benchmaxx a little higher.

Anonymous 04/19/2025 (Sat) 12:26:43 Id: 91dfd1 No. 7465

Does the sillytavern image generation function not work with REFORGE? does it have to be the old A11111 SD1.5 UI? I upgraded to reforge ages ago and it cannot seem to find the connection to my reforge when I'm running it

Anonymous 04/19/2025 (Sat) 12:38:08 Id: f6de6d No. 7467

>>7465 I've had good success with ComfyUI that's what everyone seems to be using for everything imagegen these days...

Anonymous 04/19/2025 (Sat) 12:40:49 Id: a4b366 No. 7468

>>7465 It worked on the old re-forge made by pancho. I dunno about the new one. After he stopped updating, I moved to comfy.

Anonymous 04/19/2025 (Sat) 12:43:10 Id: 91dfd1 No. 7469

>>7467 >>7468 I have never used comfyui for anything. How do you launch it so sillytavern picks it up? or better yet is there a guide for sillytavern image genning with comfyui? I just want to be able to have images be genned based on the situation mid-RP

Anonymous 04/19/2025 (Sat) 12:47:02 Id: a4b366 No. 7470

>>7469 You start it with the API active, make a workflow and then put that WF with stuff like prompt replaced via placeholders inside silly. Not as plug and play like A1111 was but lets you do a whole lot more.

Anonymous 04/19/2025 (Sat) 13:25:16 Id: 8af965 No. 7489

Any news about Qwen3? I missed the last couple of days because of the whole 4chan thing.

Anonymous 04/19/2025 (Sat) 16:19:42 Id: 3a58c9 No. 7535

Well according to the system message on 4gay they're getting shut down. So I guess this is the official /lmg/ now.

Anonymous 04/19/2025 (Sat) 16:40:39 Id: 5bc7d8 No. 7543

>>7489 qwen3 miku oo ee oo

Anonymous 04/19/2025 (Sat) 16:48:28 Id: 3a58c9 No. 7547

RIP. Perception-LM-8B ooms on a 3090. Useless model.

Anonymous 04/19/2025 (Sat) 18:04:53 Id: 20a050 No. 7593

https://8chan.se/bot/ Our own board.

Anonymous 04/19/2025 (Sat) 18:14:49 Id: 8af965 No. 7601

>>7593 We made a measly 100 posts in 4 days. Why would you want to splinter off now?

Anonymous 04/19/2025 (Sat) 18:29:22 Id: 5bc7d8 No. 7613

>>7593 no thanks

Anonymous 04/19/2025 (Sat) 18:53:10 Id: 87adbc No. 7668

>>7364 >>7364 There's a good reason for them to do this finetune that has nothing to do with us using it as R1 was essentially mostly uncucked for most purposes anyone here would care about. Retarded politicians in Washington want to ban open weights model R1 because it was made in China and keep grasping at straws for some reason to ban it (not that there's many), but since this is MIT licensed, Microsoft is probably doing some legal trolling where they would finetune it and show some use and thus could defend it in court if the boomers do end up attempting to ban it. Obviously such a law would be unenforceable and they would be shooting themselves in the foot, and code is speech and all that, but Microsoft having their own variant would probably count as a good start for a defense.

Anonymous 04/19/2025 (Sat) 18:58:54 Id: ac845c No. 7679

>>7668 Also, isn't it R1 the strongest model you can run locally right now? This could be useful for companies with pockets deep enough to run R1, but in need of a model aligned to western sensibilities.

Anonymous 04/19/2025 (Sat) 18:59:33 Id: 5bc7d8 No. 7681

>>7679 yes that is the only usecase

Anonymous 04/19/2025 (Sat) 19:05:17 Id: 87adbc No. 7689

>>7679 There was already one such finetune that came out weeks after R1 came out. Mostly though R1 isn't even that heavy on the refusals on the one thing that they tuned it against (CCP stuff), a simple prefill will avoid most issues as usual. And yes, it's close to the best open weights model currently.

Anonymous 04/19/2025 (Sat) 19:07:37 Id: 5bc7d8 No. 7695

>>7689 close?

Anonymous 04/19/2025 (Sat) 19:12:06 Id: 87adbc No. 7703

>>7695 For example a reasoning finetune of 405B can reach similar performance to R1, Nvidia did one recently. It also depends on your usecase, sometimes you may be fine with a dumber model that uses less VRAM. Also, the first DS3 on which R1 was based on had serious repetition issues (somewhat solved in 3.1), which some smaller models (such as mistral large) lacked.

Anonymous 04/19/2025 (Sat) 19:16:39 Id: 5bc7d8 No. 7706

>>7703 ugh fine, but its so safety cucked . . .

Anonymous 04/19/2025 (Sat) 19:18:54 Id: 87adbc No. 7707

>>7706 I'd just use R1, but I guess it's not uncommon for models to need some finetune after to remove "safety". Base models tend to be uncucked, but if the dataset is too filtered, the output can be too plain/boring, so ultimately you still need a finetune on top of it.

Anonymous 04/19/2025 (Sat) 19:19:58 Id: 5bc7d8 No. 7708

>>7707 >>7703 why would anyone want to use a 253B dense model over a 37B/671B MoE? if both have same-ish performance

Anonymous 04/19/2025 (Sat) 19:23:56 Id: 87adbc No. 7710

>>7708 idk, I haven't played with nvidia's tune, but maybe there's some reason? It's like asking why would someone prefer claude opus or sonnet 3.7 over R1 or whatever, might depend on taste and how it performs in specific tasks. Currently R1 could be better at tool use, it's not like they don't have things to improve. I wonder if R2 will handle those well.

Anonymous 04/19/2025 (Sat) 19:38:26 Id: 8af965 No. 7718

>>7708 No consumers at least. Can't run 250B on RAM without killing token generation speed.

Anonymous 04/19/2025 (Sat) 20:49:33 Id: 5bc7d8 No. 7747

https://meta.4chan.gay/tech/67288#p121591

Anonymous 04/19/2025 (Sat) 22:28:34 Id: d18660 No. 7777

Just want a quick update since I haven't been keeping up. Is Nemo still unbeaten by a model same parameter count or less? I'm guessing yes because it's a safe bet at this point, but figured I'd ask

Anonymous 04/19/2025 (Sat) 22:35:45 Id: 88ab39 No. 7778

dead thread, dead website, dead hobby

Anonymous 04/19/2025 (Sat) 22:49:48 Id: 5bc7d8 No. 7787

happy Easter

Anonymous 04/19/2025 (Sat) 23:15:38 Id: 7b325f No. 7812

Just got myself a 3090, what the best model I can run for peak kino AI lewd roleplays?

Anonymous 04/19/2025 (Sat) 23:17:04 Id: 5bc7d8 No. 7814

>>7812 post rest of your specs, also https://meta.4chan.gay/tech/67288?last=100#bottom is more active

Anonymous 04/19/2025 (Sat) 23:17:43 Id: 5bc7d8 No. 7817

>>7812 cydonia

Anonymous 04/19/2025 (Sat) 23:18:08 Id: 5bc7d8 No. 7818

>>7812 MS-Magpantheonsel-lark-v4x1.6.2RP-Cydonia-vXXX-22B-8.i1-IQ4_XS.gguf

Anonymous 04/20/2025 (Sun) 00:24:57 Id: ea3c91 No. 7882

>comfy thread >growing website >developing hobby

Anonymous 04/20/2025 (Sun) 00:48:46 Id: 911ffc No. 7886

>>7747 >https://huggingface.co/OnomaAIResearch/Illustrious-XL-v2.0 >Illustrious XL 1.0-2.0 series aims to stabilize native generation at 1536 resolution while significantly improving natural language understanding capabilities. Not really that interesting, I think it is hitting against the limits of what SDXL can do without Vpred. I expect a lot of models to probably rebase on this since we will probably never get local 3.0/3.5 Vpred from Angel and how funding has essentially almost stopped. >https://huggingface.co/OnomaAIResearch/Illustrious-Lumina-v0.03 >This model is based on Alpha-VLLM/Lumina-Image-2.0 , which is nice small DiT model with minimal guaranteed functionality! Please refer to https://github.com/Alpha-VLLM/Lumina-Image-2.0 for official repository. This is interesting but I suspect he tried to train it before their technical report was out. Lumina was trained on extremely details and long captions for tags and boomer prompting and they even built their own tool for that. I suspect the training wasn't as effective as it should've been because of that, and as the model says, it can recognize characters now but it is still severely undertrained to the extent where it doesn't even equal the training done on Illustrious v0.1

Anonymous 04/20/2025 (Sun) 02:22:49 Id: 3a58c9 No. 7940

What's with the fake 404 on 4gay?

Anonymous 04/20/2025 (Sun) 02:36:12 Id: 7d7a2a No. 7944

How about model for sci-fi novel slop?

Anonymous 04/20/2025 (Sun) 02:37:49 Id: ea3c91 No. 7946

>>7940 4chan got pwnd by sharty

Anonymous 04/20/2025 (Sun) 02:38:31 Id: 3a58c9 No. 7947

>>7946 i mean 4chan.gay

Anonymous 04/20/2025 (Sun) 03:34:48 Id: 1c744e No. 7958

>>7814 Will those niggers just come here instead I'm not going to a pizzachan

Anonymous 04/20/2025 (Sun) 03:38:18 Id: 0e20ba No. 7959

>>7958 they'll pick literally anywhere else but here. is it because of muh ids?

Anonymous 04/20/2025 (Sun) 03:59:36 Id: 91dfd1 No. 7960

(17.62 KB 550x107 s5.png)

I get this red text each time I launch silly. What exactly is this and how do I fix it, idk where exactly it wants me to click for this. I've ignored it so far

Anonymous 04/20/2025 (Sun) 04:05:06 Id: 6ae1b3 No. 7962

>>7960 Choose Text Completion on the 2nd dropdown list under API text.

Anonymous 04/20/2025 (Sun) 04:11:15 Id: b24ad6 No. 7965

>>7959 It actually is ids, lmg has a history of randomly being spammed (by soijack party users no less) so obviously they won't post here

Anonymous 04/20/2025 (Sun) 04:13:41 Id: b1a103 No. 7968

>>7358 >Implying that 50% of /lmg/ discussion wasn't always about trying out whatever new meme finetune

Anonymous 04/20/2025 (Sun) 04:30:37 Id: 91dfd1 No. 7974

>>7962 I'll give it a try next time ty

Anonymous 04/20/2025 (Sun) 04:31:50 Id: 0e20ba No. 7975

(44.27 KB 1734x302 retardedtwice.webp)

>>7965 i love ids

Anonymous 04/20/2025 (Sun) 04:53:56 Id: 1037e2 No. 7980

>>7975 Based. Easy to get around though. I post through a vpn and get a new ID every time without changing anything. Not intentional, I like IDs.

Anonymous 04/20/2025 (Sun) 06:45:30 Id: dfb40b No. 7999

>riverwind Is this a trolling model? I keep getting shilled by products.

petr 04/20/2025 (Sun) 06:46:13 Id: 4c17cd No. 8001

>>7999 kek

Anonymous 04/20/2025 (Sun) 06:46:30 Id: 4c17cd No. 8002

>>8001 >001 AAAAAAAAAACCCCCCCCKKKK

Anonymous 04/20/2025 (Sun) 07:23:31 Id: 8b4562 No. 8012

>>7999 yes its a troll model, unironically great at what its made to do

Anonymous 04/20/2025 (Sun) 07:27:52 Id: 5fa221 No. 8016

>>7999 pretty sure it was an april fools day project that wasnt ready in time

Anonymous 04/20/2025 (Sun) 07:59:25 Id: 48eb0b No. 8021

>>7959 Probably, the guy who makes most of the posts there replied to himself here twice >>7237 >>7264 >>7275 >>7276

Anonymous 04/20/2025 (Sun) 08:23:14 Id: f38237 No. 8027

>>8021 What the fuck lmao what a weird cunt. if 4chan ever comes back IDs need to be on every board to out freakshows like this

Anonymous 04/20/2025 (Sun) 08:31:29 Id: 8b4562 No. 8028

>>8021 Why the fuck are You giving him (You)s

Anonymous 04/20/2025 (Sun) 08:33:34 Id: 0e20ba No. 8029

>>8021 What causes one to behave this way?

Anonymous 04/20/2025 (Sun) 08:39:24 Id: 2f83dc No. 8031

>>8028 you's are not currency dont be a faggot, this person deserves to be pointed out and shamed

Anonymous 04/20/2025 (Sun) 08:41:43 Id: 8af965 No. 8032

>>8029 Mental illness.

Anonymous 04/20/2025 (Sun) 08:43:13 Id: 8b4562 No. 8033

https://github.com/JohannesGaessler/elo_hellm >Elo HeLLM is a project for establishing a ranking based on Elo ratings between large language models. The context is that I'm working on training code for llama.cpp. llama.cpp has methods for estimating the quality loss from quantization but it lacks methods for estimating the quality of a model in absolute terms or for making comparisons between different models. I intend to co-develop this project with the llama.cpp training code for quality control. The approach is to merge an arbitrary number of quality metrics into a single Elo rating for a model using statistical methods. One category of such quality metrics are simply the results of language model benchmarks such as MMLU. Results from competitive games such as Chess can also be used (not yet implemented).

Anonymous 04/20/2025 (Sun) 08:55:27 Id: bb0219 No. 8036

Hey wait wtf. I just noticed that my post here >>7237 has the same ID as a bunch of other posts in the thread that aren't mine. I'm serious. Also, I don't see "(You)" in the replies. I'm getting spooked what the hell.

Anonymous 04/20/2025 (Sun) 08:56:25 Id: bb0219 No. 8037

>>8036 Why is my id different ahhhhhhh.

Anonymous 04/20/2025 (Sun) 08:57:23 Id: 8b4562 No. 8038

>tfw anons that leave /lmg/ for too long get assimilated into petra after all

Anonymous 04/20/2025 (Sun) 08:57:30 Id: 1bac8e No. 8039

>>8036 fake until proven gay

Anonymous 04/20/2025 (Sun) 08:58:57 Id: 0e20ba No. 8040

>>8038 >petrified petra is a gorgon

Anonymous 04/20/2025 (Sun) 09:00:17 Id: bb0219 No. 8041

>>8038 >>8039 But seriously though this is creepy. Are the mods messing with me? Did I get hacked? How am I even supposed to get proof in this situation?

Anonymous 04/20/2025 (Sun) 09:02:25 Id: 48eb0b No. 8043

>>8041 Why do you care? Even if you are telling the truth you are anonymous and have no identity worth protecting.

Anonymous 04/20/2025 (Sun) 09:06:39 Id: 8af965 No. 8044

>>8041 Your IP could have changed and some other guy has gotten your exact previous one. Which is probably less likely than winning the lottery.

Anonymous 04/20/2025 (Sun) 09:06:48 Id: 8b4562 No. 8045

serial expetriments lain

Anonymous 04/20/2025 (Sun) 09:06:54 Id: b1e463 No. 8046

>>8041 In all probability, someone is just using the same VPN.

Anonymous 04/20/2025 (Sun) 09:07:49 Id: 0e20ba No. 8047

>>8046 this is true. watch me change me id by changing my vpn

Anonymous 04/20/2025 (Sun) 09:08:18 Id: 6f10fa No. 8048

>>8047 Sex with AI.

Anonymous 04/20/2025 (Sun) 09:08:34 Id: 465644 No. 8049

>>8048 as shrimple as that

Anonymous 04/20/2025 (Sun) 09:08:35 Id: 8b4562 No. 8050

anons what if he hacked 8chan too?

Anonymous 04/20/2025 (Sun) 09:10:27 Id: 8af965 No. 8051

>>8050 He won't get away with it on 16chan.

Anonymous 04/20/2025 (Sun) 09:11:28 Id: 8b4562 No. 8052

Christ is risen Hitler's birthday Kikes seething

Anonymous 04/20/2025 (Sun) 09:13:03 Id: bb0219 No. 8053

>>8043 Why would I not care? ID's serve a purpose, and people are treating them as something that has a purpose, so if they can be undermined, then we can't really treat them the same anymore. And I don't see why it someone wouldn't be concerned if they were the target of some mod trolling or other activity, assuming this wasn't due to a bug or some one in a million chance. >>8044 Last I checked I have a static IP. I do use librewolf though which might change my canvas/fingerprint around sometimes, does this site use other indicators other than IP to assign an ID? If so then perhaps that's why. >>8046 I wasn't using a VPN when I made that first post, and I'm not using one right now. I did use a VPN to take a look at gay 4chan tho.

Anonymous 04/20/2025 (Sun) 09:45:12 Id: 7d7a2a No. 8067

I got different IDs too even though I have (supposedly) static IP.

Anonymous 04/20/2025 (Sun) 09:53:11 Id: 65ba16 No. 8068

>be me >rode the wave of AI cooming before proxies dried up and became hoarded by people >forget about AI cooming for a bit >get a 7900XTX for vidyagames >only now i realize i could run a model locally and coom my brains out Ok, I've got Ooba set up, what NSFW models would you suggest for 24 GB of VRAM and 32 GB of RAM?

Anonymous 04/20/2025 (Sun) 09:55:50 Id: 8b4562 No. 8069

>>8068 nevoria 70b or whatever its called

Index Catalog Archive Top Reply

Manage Board Moderate Board Moderate Thread

Forms

Delete

Password Unlink (Removes file reference from posts) Delete (Removes file from the server)

Report

Reason Category Global

No Cookies?

Quick Reply


Sage Bypass Check