<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Inference Hub Blog</title>
    <link>https://inferencehub.org/blog</link>
    <description>Analysis of AI inference providers, model pricing, and the LLM market.</description>
    <language>en-us</language>
    <atom:link href="https://inferencehub.org/rss.xml" rel="self" type="application/rss+xml" />
    <lastBuildDate>Tue, 30 Jun 2026 00:00:00 GMT</lastBuildDate>
    <item>
      <title>GLM-5.2 Is Cheaper and Better at Coding — and the Fable 5 Ban Shows Why Open Weights Win</title>
      <link>https://inferencehub.org/blog/glm-5-2-open-weight-coding-fable-5-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/glm-5-2-open-weight-coding-fable-5-2026</guid>
      <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
      <description>Z.ai's GLM-5.2 beats GPT-5.5 on long-horizon coding for ~1/6th the cost. Days earlier, Anthropic's Claude Fable 5 was switched off worldwide by a US government order. Together they explain why open-weight Chinese models are becoming the default.</description>
      <category>glm</category>
      <category>z-ai</category>
      <category>coding</category>
      <category>open-source</category>
      <category>chinese-models</category>
      <category>fable-5</category>
    </item>
    <item>
      <title>Stop Picking One Model: A 2026 Guide to LLM Routing Libraries</title>
      <link>https://inferencehub.org/blog/llm-routing-libraries-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/llm-routing-libraries-2026</guid>
      <pubDate>Tue, 30 Jun 2026 00:00:00 GMT</pubDate>
      <description>Libraries that automatically send each prompt to the best model for the task — RouteLLM, LiteLLM, semantic-router, vLLM Semantic Router, Not Diamond and more. How they decide, what they save (40–85%), and the honest tradeoffs from Hacker News.</description>
      <category>llm-routing</category>
      <category>litellm</category>
      <category>routellm</category>
      <category>open-source</category>
      <category>infrastructure</category>
      <category>cost-optimization</category>
    </item>
    <item>
      <title>AI APIs with Free Tiers in 2026: The Complete Guide</title>
      <link>https://inferencehub.org/blog/ai-api-free-tier-guide-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/ai-api-free-tier-guide-2026</guid>
      <pubDate>Fri, 17 Apr 2026 00:00:00 GMT</pubDate>
      <description>Every AI inference API with a free tier — LLMs, image gen, video, and audio. Start building without a credit card using Groq, Cloudflare, Together AI, DeepInfra, and more.</description>
      <category>free-tier</category>
      <category>pricing</category>
      <category>guide</category>
      <category>llm</category>
      <category>image-generation</category>
    </item>
    <item>
      <title>Alibaba's Qwen Model Family Explained: Every Model Line From LLMs to Video Generation</title>
      <link>https://inferencehub.org/blog/alibaba-qwen-model-family-explained-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/alibaba-qwen-model-family-explained-2026</guid>
      <pubDate>Mon, 13 Apr 2026 00:00:00 GMT</pubDate>
      <description>A complete guide to Alibaba's Qwen ecosystem — Qwen 3/3.5/3.6 LLMs, QwQ reasoning, Qwen-VL vision, Qwen-Omni multimodal, Qwen-Coder, Wan video generation, and more. What each model does and when to use it.</description>
      <category>alibaba-cloud</category>
      <category>qwen</category>
      <category>models</category>
      <category>guide</category>
      <category>multimodal</category>
    </item>
    <item>
      <title>Best Free AI Image Platforms in 2026: 15 Platforms with Recurring Free Credits</title>
      <link>https://inferencehub.org/blog/best-free-ai-image-platforms-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/best-free-ai-image-platforms-2026</guid>
      <pubDate>Thu, 09 Apr 2026 00:00:00 GMT</pubDate>
      <description>Every AI image generator with a free tier that resets daily or monthly — ranked by how many free images you actually get. From 500/day on Playground AI to 25/month on Adobe Firefly.</description>
      <category>image-generation</category>
      <category>free-tier</category>
      <category>comparison</category>
      <category>platforms</category>
    </item>
    <item>
      <title>Chinese Frontier Open-Source AI Models in 2026: The Labs, the Models, and How They Stack Up</title>
      <link>https://inferencehub.org/blog/chinese-frontier-open-source-ai-models-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/chinese-frontier-open-source-ai-models-2026</guid>
      <pubDate>Thu, 09 Apr 2026 00:00:00 GMT</pubDate>
      <description>A deep dive into China's open-weight AI revolution — Qwen, DeepSeek, GLM, Kimi, ERNIE, MiniMax, and more. Arena rankings, benchmark comparisons vs Claude and GPT, pricing, and why Chinese open-source models now dominate HuggingFace downloads.</description>
      <category>chinese-models</category>
      <category>open-source</category>
      <category>comparison</category>
      <category>qwen</category>
      <category>deepseek</category>
      <category>glm</category>
      <category>frontier</category>
    </item>
    <item>
      <title>Alibaba Cloud Qwen API Pricing in 2026: Free Tier, Model Studio Costs, and Cheapest Alternatives</title>
      <link>https://inferencehub.org/blog/alibaba-cloud-qwen-api-pricing-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/alibaba-cloud-qwen-api-pricing-2026</guid>
      <pubDate>Wed, 08 Apr 2026 00:00:00 GMT</pubDate>
      <description>Complete breakdown of Alibaba Cloud Model Studio pricing for Qwen 3.6 Plus, Qwen 3.5, Qwen 3 Max, and more. 1M free tokens per model, plus how third-party providers compare.</description>
      <category>alibaba-cloud</category>
      <category>qwen</category>
      <category>pricing</category>
      <category>free-tier</category>
      <category>comparison</category>
    </item>
    <item>
      <title>GLM-5.1 Released: Z.ai's Coding-First Frontier Model Now Available via API</title>
      <link>https://inferencehub.org/blog/glm-5-1-release-api-providers-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/glm-5-1-release-api-providers-2026</guid>
      <pubDate>Wed, 08 Apr 2026 00:00:00 GMT</pubDate>
      <description>Z.ai (formerly Zhipu AI) releases GLM-5.1, a frontier coding model scoring 94% of Claude Opus 4.6 on coding benchmarks. Here's what's new and where to access it.</description>
      <category>glm</category>
      <category>z-ai</category>
      <category>coding</category>
      <category>new-release</category>
    </item>
    <item>
      <title>Cheapest Claude API Provider in 2026: Save Up to 65% on Anthropic Models</title>
      <link>https://inferencehub.org/blog/cheapest-claude-api-provider-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/cheapest-claude-api-provider-2026</guid>
      <pubDate>Tue, 07 Apr 2026 00:00:00 GMT</pubDate>
      <description>Compare Claude API pricing across 8 providers. KIE AI offers Claude 4.6 Sonnet at $1.05 input — 65% cheaper than Anthropic's direct pricing. Full breakdown inside.</description>
      <category>claude</category>
      <category>pricing</category>
      <category>comparison</category>
      <category>llm</category>
    </item>
    <item>
      <title>$0.004 vs $0.08 Per Image: Can You Tell the Difference?</title>
      <link>https://inferencehub.org/blog/cheapest-image-generation-api-2026</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/cheapest-image-generation-api-2026</guid>
      <pubDate>Sat, 04 Apr 2026 00:00:00 GMT</pubDate>
      <description>We ran the same prompts on Qwen Z-Image ($0.004), GPT Image 1.5 ($0.013), Seedream 5.0 Lite ($0.028), and Nano Banana 2 ($0.04). The results might surprise you — and save you thousands.</description>
      <category>image-generation</category>
      <category>pricing</category>
      <category>comparison</category>
    </item>
    <item>
      <title>Introducing Inference Hub: Compare AI Inference Providers in One Place</title>
      <link>https://inferencehub.org/blog/introducing-inferencehub</link>
      <guid isPermaLink="true">https://inferencehub.org/blog/introducing-inferencehub</guid>
      <pubDate>Sat, 04 Apr 2026 00:00:00 GMT</pubDate>
      <description>We built Inference Hub to solve one problem — finding the right AI inference provider shouldn't require opening 20 tabs. Compare pricing, models, and features across every major provider.</description>
      <category>announcement</category>
      <category>launch</category>
    </item>
  </channel>
</rss>