2026-05-29 13:20 GMT+8 Β· summary_2026-05-29_13-20.md
π€ AI News Summary - 2026-05-29 13:20 GMT+8
Focused AI/dev subreddit roundup.
Full site: https://ai-news-summary.pages.dev/
What changed since last run
- I benchmarked 6 self-hosted book server apps up to 150K books (ingestion time + RAM/CPU) β r/selfhosted
- DGX Spark test β r/LocalLLaMA
- Granite 4.1 Architecture Changes? β r/LocalLLaMA
- LiquidAI/LFM2.5-8B-A1B Β· Hugging Face β r/LocalLLaMA
- Payload request β r/OpenWebUI
- using deepseek and have another model for vision β r/OpenWebUI
- AI consultant reveals a client accidentally spent $500,000,000.00 in a single month after failing to set employee limits on Claude usage. β r/llmdevs
- Anyone scanning AI agent skills for security issues before deployment? Feels like the next supply chain blind spot. β r/llmdevs
- I built a free, native iOS app for Papra (self-hosted docs) β itβs in App Store review and going live soon β r/selfhosted
- Qwen3.6 35B - TXT vs Markdown vs HTML vs HTML+CSS β r/LocalLLaMA
- StepFun 3.7 Flash - Speed Benchmark in M5 Max β r/LocalLLaMA
- TrailBase 0.28: Fast, open, single-executable Firebase alternative - now w/ Postgres β r/selfhosted
r/openai
- No non-pinned/newsworthy posts fetched after filtering.
r/LocalLLaMA
| # | Post | Summary | Time | Score | Author | Community reaction |
|---|---|---|---|---|---|---|
| 1 | DGX Spark test | I have tested my new spark with vLLM , as I read few bad review. Testes with 4,8,16,32 paralel llm call, >1000 prompt token, >1500 response token It was still working! | 2026-05-29 12:04 GMT+8 | /u/codeltd | ||
| 2 | Granite 4.1 Architecture Changes? | Anyone know why IBM decided to return to a pure transformer model for Granite 4.1? They mention in their release post that it’s easier to fine-tune than Granite 4, but surely the drawbacks outweigh this benefit, especially for a model that is often used for very well-defined basic tasks like document summarization,… | 2026-05-29 01:44 GMT+8 | /u/the-salami | Community reaction (heuristic-fallback): The comment section is split between concerned and positive. Top reactions focus on Nah, there’s a ton of arguments for their architecture choice. A) Framework adoption is real. Mamba / SSM / RNNs and MoE have a real… | Thanks, this is the type of information I was looking for. I guess the perceived business value of these architectures just happened to…. Overall sentiment β post: mixed; author: mixed. Reply threads: 2026-05-29 02:03 GMT+8: post=mixed, author=mixed β Nah, there’s a ton of arguments for their architecture choice. A) Framework adoption is real. Mamba / SSM /… | 2026-05-29 03:33 GMT+8: post=mixed, author=mixed β Thanks, this is the type of information I was looking for. I guess the perceived business value of these… | 2026-05-29 03:38 GMT+8: post=mixed, author=mixed β By the way, one possible problem I am seeing with your setup: GGUF is generally not recommended where speed… | |
| 3 | LiquidAI/LFM2.5-8B-A1B Β· Hugging Face | [Image: LiquidAI/LFM2.5-8B-A1B Β· Hugging Face] looks like you can run it on any potato (A1B)! https://huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF (https://huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF) from LiquidAI: LFM2.5 is a new family of hybrid models designed for on-device deployment. | 2026-05-29 00:16 GMT+8 | /u/jacek2023 | ||
| 4 | Qwen3.6 35B - TXT vs Markdown vs HTML vs HTML+CSS | Theres been talk of late about using HTML rather than markdown in Claude Code. I was curious how this worked with a local model so loaded up Qwen3.6 35B A3B at Q8 and F16 KV cache. | 2026-05-29 00:05 GMT+8 | /u/BigYoSpeck | ||
| 5 | StepFun 3.7 Flash - Speed Benchmark in M5 Max | [Image: StepFun 3.7 Flash - Speed Benchmark in M5 Max] Just ran a benchmark with day-0 shipped llama.cpp’s branch. M5 Max: 128 GB - Q4_K_S / memory peak around ~120+ GB making things sluggish but still usable once cmd+tab landed. | 2026-05-29 12:04 GMT+8 | /u/Beamsters |
r/llmdevs
| # | Post | Summary | Time | Score | Author | Community reaction |
|---|---|---|---|---|---|---|
| 1 | AI consultant reveals a client accidentally spent $500,000,000.00 in a single month after failing to set employee limits on Claude usage. | [Image: AI consultant reveals a client accidentally spent $500,000,000.00 in a single month after failing to set employee limits on Claude usage.] AXIOS AI REPORTER JUST REVEALED A CO. | 2026-05-29 03:53 GMT+8 | /u/Bbamf10 | Community reaction (heuristic-fallback): The comment section is split between concerned and joking. Top reactions focus on No It was literally exactly 500 million dollars and zero cents. Claude caps it that’s how it works \s | Latest bullshit narrative is token cost. $200 per month code plan is more than enough for an average dev. Making a $10-20k per month…. Overall sentiment β post: mixed; author: mixed. Reply threads: 2026-05-29 04:58 GMT+8: post=mixed, author=mixed β No It was literally exactly 500 million dollars and zero cents. Claude caps it that’s how it works \s | 2026-05-29 05:19 GMT+8: post=mixed, author=mixed β Latest bullshit narrative is token cost. $200 per month code plan is more than enough for an average dev…. | 2026-05-29 07:18 GMT+8: post=mixed, author=mixed β You can’t use consumer subs for enterprise | |
| 2 | Anyone scanning AI agent skills for security issues before deployment? Feels like the next supply chain blind spot. | I mean skills can exfiltrate data, steal creds, abuse permissions etc. We audit everything else in the pipeline but these get installed with no review. | 2026-05-29 08:57 GMT+8 | /u/Ill-Database4116 |
r/OpenWebUI
| # | Post | Summary | Time | Score | Author | Community reaction |
|---|---|---|---|---|---|---|
| 1 | Payload request | How is the payload set before you send a payload request message. reading https://docs.openwebui.com/features/extensibility/plugin/development/rich-ui#payload-requests (https://docs.openwebui.com/features/extensibility/plugin/development/rich-ui#payload-requests) you requested using `parent.postMessage({ type:… | 2026-05-29 00:23 GMT+8 | /u/dani_california97 | ||
| 2 | using deepseek and have another model for vision | Hi all, i’ve been trying to have my “own” IA and started playing with OpenWEBUI Transcript: I got whisper working through groq, works beautifully Embedded: sentence-transformers/paraphrase-multilingual-mpnet-base-v2 works great Generate image: I use a specific chat with… | 2026-05-29 09:59 GMT+8 | /u/bruno_mac_douglas |
r/selfhosted
| # | Post | Summary | Time | Score | Author | Community reaction |
|---|---|---|---|---|---|---|
| 1 | I benchmarked 6 self-hosted book server apps up to 150K books (ingestion time + RAM/CPU) | [Image: I benchmarked 6 self-hosted book server apps up to 150K books (ingestion time + RAM/CPU)] Iβve been trying to find the best self-hosted app for managing my large library (~150K books). After seeing a lot of recommendations across Reddit, I decided to run the same repeatable load test across Grimmory, Kavita,… | 2026-05-29 06:12 GMT+8 | /u/MysteriousPizza8390 | ||
| 2 | I built a free, native iOS app for Papra (self-hosted docs) β itβs in App Store review and going live soon | [Image: I built a free, native iOS app for Papra (self-hosted docs) β itβs in App Store review and going live soon] Hey r/selfhosted (/r/selfhosted) π I self-host Papra (the minimalist open-source doc/receipt/invoice archiving server) and love it β but I wanted to actually reach my documents from my phone without… | 2026-05-29 10:53 GMT+8 | /u/NatLife | ||
| 3 | TrailBase 0.28: Fast, open, single-executable Firebase alternative - now w/ Postgres | [Image: TrailBase 0.28: Fast, open, single-executable Firebase alternative - now w/ Postgres] TrailBase (https://github.com/trailbaseio/trailbase) is an open, fast (https://trailbase.io/reference/benchmarks) Firebase-like server for building apps. It provides type-safe REST APIs + change subscriptions, auth, multi-DB,… | 2026-05-29 07:20 GMT+8 | /u/trailbaseio |
r/ClaudeAI
- No non-pinned/newsworthy posts fetched after filtering.
r/ClaudeCode
- No non-pinned/newsworthy posts fetched after filtering.
r/Codex
- No non-pinned/newsworthy posts fetched after filtering.
Generated 2026-05-29 13:20 GMT+8 | Next update in 2 hours