2026-05-29 13:20 GMT+8 · summary_2026-05-29_13-20.md

🤖 AI News Summary - 2026-05-29 13:20 GMT+8

Focused AI/dev subreddit roundup.

Full site: https://ai-news-summary.pages.dev/

What changed since last run

I benchmarked 6 self-hosted book server apps up to 150K books (ingestion time + RAM/CPU) — r/selfhosted
DGX Spark test — r/LocalLLaMA
Granite 4.1 Architecture Changes? — r/LocalLLaMA
LiquidAI/LFM2.5-8B-A1B · Hugging Face — r/LocalLLaMA
Payload request — r/OpenWebUI
using deepseek and have another model for vision — r/OpenWebUI
AI consultant reveals a client accidentally spent $500,000,000.00 in a single month after failing to set employee limits on Claude usage. — r/llmdevs
Anyone scanning AI agent skills for security issues before deployment? Feels like the next supply chain blind spot. — r/llmdevs
I built a free, native iOS app for Papra (self-hosted docs) — it’s in App Store review and going live soon — r/selfhosted
Qwen3.6 35B - TXT vs Markdown vs HTML vs HTML+CSS — r/LocalLLaMA
StepFun 3.7 Flash - Speed Benchmark in M5 Max — r/LocalLLaMA
TrailBase 0.28: Fast, open, single-executable Firebase alternative - now w/ Postgres — r/selfhosted

r/openai

No non-pinned/newsworthy posts fetched after filtering.

r/LocalLLaMA

#	Post	Summary	Time	Author	Community reaction
1	DGX Spark test	I have tested my new spark with vLLM , as I read few bad review. Testes with 4,8,16,32 paralel llm call, >1000 prompt token, >1500 response token It was still working!	2026-05-29 12:04 GMT+8	/u/codeltd
2	Granite 4.1 Architecture Changes?	Anyone know why IBM decided to return to a pure transformer model for Granite 4.1? They mention in their release post that it’s easier to fine-tune than Granite 4, but surely the drawbacks outweigh this benefit, especially for a model that is often used for very well-defined basic tasks like document summarization,…	2026-05-29 01:44 GMT+8	/u/the-salami	Community reaction (heuristic-fallback): The comment section is split between concerned and positive. Top reactions focus on Nah, there’s a ton of arguments for their architecture choice. A) Framework adoption is real. Mamba / SSM / RNNs and MoE have a real… \| Thanks, this is the type of information I was looking for. I guess the perceived business value of these architectures just happened to…. Overall sentiment — post: mixed; author: mixed. Reply threads: 2026-05-29 02:03 GMT+8: post=mixed, author=mixed — Nah, there’s a ton of arguments for their architecture choice. A) Framework adoption is real. Mamba / SSM /… \| 2026-05-29 03:33 GMT+8: post=mixed, author=mixed — Thanks, this is the type of information I was looking for. I guess the perceived business value of these… \| 2026-05-29 03:38 GMT+8: post=mixed, author=mixed — By the way, one possible problem I am seeing with your setup: GGUF is generally not recommended where speed…
3	LiquidAI/LFM2.5-8B-A1B · Hugging Face	[Image: LiquidAI/LFM2.5-8B-A1B · Hugging Face] looks like you can run it on any potato (A1B)! https://huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF (https://huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF) from LiquidAI: LFM2.5 is a new family of hybrid models designed for on-device deployment.	2026-05-29 00:16 GMT+8	/u/jacek2023
4	Qwen3.6 35B - TXT vs Markdown vs HTML vs HTML+CSS	Theres been talk of late about using HTML rather than markdown in Claude Code. I was curious how this worked with a local model so loaded up Qwen3.6 35B A3B at Q8 and F16 KV cache.	2026-05-29 00:05 GMT+8	/u/BigYoSpeck
5	StepFun 3.7 Flash - Speed Benchmark in M5 Max	[Image: StepFun 3.7 Flash - Speed Benchmark in M5 Max] Just ran a benchmark with day-0 shipped llama.cpp’s branch. M5 Max: 128 GB - Q4_K_S / memory peak around ~120+ GB making things sluggish but still usable once cmd+tab landed.	2026-05-29 12:04 GMT+8	/u/Beamsters

r/llmdevs

#	Post	Summary	Time	Score	Author	Community reaction
1	AI consultant reveals a client accidentally spent $500,000,000.00 in a single month after failing to set employee limits on Claude usage.	[Image: AI consultant reveals a client accidentally spent $500,000,000.00 in a single month after failing to set employee limits on Claude usage.] AXIOS AI REPORTER JUST REVEALED A CO.	2026-05-29 03:53 GMT+8		/u/Bbamf10	Community reaction (heuristic-fallback): The comment section is split between concerned and joking. Top reactions focus on No It was literally exactly 500 million dollars and zero cents. Claude caps it that’s how it works \s \| Latest bullshit narrative is token cost. $200 per month code plan is more than enough for an average dev. Making a $10-20k per month…. Overall sentiment — post: mixed; author: mixed. Reply threads: 2026-05-29 04:58 GMT+8: post=mixed, author=mixed — No It was literally exactly 500 million dollars and zero cents. Claude caps it that’s how it works \s \| 2026-05-29 05:19 GMT+8: post=mixed, author=mixed — Latest bullshit narrative is token cost. $200 per month code plan is more than enough for an average dev…. \| 2026-05-29 07:18 GMT+8: post=mixed, author=mixed — You can’t use consumer subs for enterprise
2	Anyone scanning AI agent skills for security issues before deployment? Feels like the next supply chain blind spot.	I mean skills can exfiltrate data, steal creds, abuse permissions etc. We audit everything else in the pipeline but these get installed with no review.	2026-05-29 08:57 GMT+8		/u/Ill-Database4116

r/OpenWebUI

#	Post	Summary	Time	Score	Author	Community reaction
1	Payload request	How is the payload set before you send a payload request message. reading https://docs.openwebui.com/features/extensibility/plugin/development/rich-ui#payload-requests (https://docs.openwebui.com/features/extensibility/plugin/development/rich-ui#payload-requests) you requested using `parent.postMessage({ type:…	2026-05-29 00:23 GMT+8		/u/dani_california97
2	using deepseek and have another model for vision	Hi all, i’ve been trying to have my “own” IA and started playing with OpenWEBUI Transcript: I got whisper working through groq, works beautifully Embedded: sentence-transformers/paraphrase-multilingual-mpnet-base-v2 works great Generate image: I use a specific chat with…	2026-05-29 09:59 GMT+8		/u/bruno_mac_douglas

r/selfhosted

#	Post	Summary	Time	Author
1	I benchmarked 6 self-hosted book server apps up to 150K books (ingestion time + RAM/CPU)	[Image: I benchmarked 6 self-hosted book server apps up to 150K books (ingestion time + RAM/CPU)] I’ve been trying to find the best self-hosted app for managing my large library (~150K books). After seeing a lot of recommendations across Reddit, I decided to run the same repeatable load test across Grimmory, Kavita,…	2026-05-29 06:12 GMT+8	/u/MysteriousPizza8390
2	I built a free, native iOS app for Papra (self-hosted docs) — it’s in App Store review and going live soon	[Image: I built a free, native iOS app for Papra (self-hosted docs) — it’s in App Store review and going live soon] Hey r/selfhosted (/r/selfhosted) 👋 I self-host Papra (the minimalist open-source doc/receipt/invoice archiving server) and love it — but I wanted to actually reach my documents from my phone without…	2026-05-29 10:53 GMT+8	/u/NatLife
3	TrailBase 0.28: Fast, open, single-executable Firebase alternative - now w/ Postgres	[Image: TrailBase 0.28: Fast, open, single-executable Firebase alternative - now w/ Postgres] TrailBase (https://github.com/trailbaseio/trailbase) is an open, fast (https://trailbase.io/reference/benchmarks) Firebase-like server for building apps. It provides type-safe REST APIs + change subscriptions, auth, multi-DB,…	2026-05-29 07:20 GMT+8	/u/trailbaseio

r/ClaudeAI

No non-pinned/newsworthy posts fetched after filtering.

r/ClaudeCode

No non-pinned/newsworthy posts fetched after filtering.

r/Codex

No non-pinned/newsworthy posts fetched after filtering.

Generated 2026-05-29 13:20 GMT+8 | Next update in 2 hours