Daily Digest

Wednesday, 20 May 2026

This dated roundup collects the most interesting AI and technology developments found for Wednesday, 20 May 2026.

4 min read Published from the latest available digest

Policy & Ethics

Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs

Hey r/LocalLLaMA, We’ve released our ByteShape Qwen 3.6 35B GGUF quantizations in two families: standard NTP (Next Token Prediction or non-MTP) and MTP. Blog / Download NTP Models / Download MTP Models TL;DR For NTP, “pick the largest quant that fits” worked surprisingly well. Lower bpw was not automatically better: our largest model was very hard to beat on quality/speed, including prompt processing and token generation. MTP gave a real GPU generation-speed boost, usually around 20–40%, but the extra memory footprint can change what fits. MTP speedup is heavily workload dependent. CPU MTP was not attractive in our tests, so our CPU recommendation remains NTP. We excluded MMLU from this release because Qwen 3.6 showed answer-format compliance issues in full precision, making it a noisy quantization-comparison signal. For this release, we tried to make the comparison more of a small hardware study than just a model drop. We benchmarked the original model and a broader set of quantized variants across RT…

Read more

PrivateScribe.ai - Fully local, MIT licensed, free AI transcription built with HIPAA/legal safeguards in mind - One Year Update!

I first posted about PrivateScribe.ai \~1yr ago and have recently jumped back intent on bringing it to a functionality that makes it actually usable by non-technical users. One year ago it worked but only the bare minimum. Since then I've gotten ⭐️74 github stars!⭐️ and have had a few meetings with people that has inspired me to push it forward. PrivateScribe is a fully local, open source AI transcription platform using FasterWhisper, pyannote, and Ollama, built with Vite/Flask/SQLite. I am an ER physician in my second life and I've approached a lot of this project with a focus on privacy and specifically HIPAA workflow requirements. The medical world has been flooded with dozen(s) of AI-transcription startups focusing on free tiers with the ever-questionable data policies or permanent subscriptions and I'm still strongly of the opinion this is a solvable problem locally especially for small clinics, therapists, and beyond medicine into law, counseling, and personal use. Excited to share the major updates: A signed, notarized, bundled macOS app \- launch ETA this Friday! Ollama, pyannote, everything bundled into the application so no separate install…

Read more

This digest was automatically generated • 4 min read