The economics of Bubble Workload Units (WUs) vs AI API calls

The economics of Bubble Workload Units (WUs) vs AI API calls

June 5, 2026

When you choose a tech stack for your web application, you aren’t just choosing a development experience. You’re choosing a billing model. For years, visual development was governed by straightforward hosting plans. You paid a flat monthly fee for a tier, and you got a set allocation of server capacity.

That changed when Bubble introduced Workload Units (WUs). Instead of charging for visual space or database size limits, they began charging for every individual action your database, logic system, and page loads execute. Around the same time, the rise of vibe coding and AI builders introduced a different billing model: token-based API pricing.

If you’re deciding between a visual builder like Bubble and an AI-native stack, you need to understand how these systems scale. An app that costs fifty dollars a month to run during your testing phase can easily spike to hundreds or thousands of dollars once real users start interacting with it. Let’s analyze the cost scaling of databases, logic triggers, and credits under both models.

How Bubble’s Workload Units actually work

To understand Bubble’s economics, you have to look at the Workload Unit. A WU is a metric created by Bubble to quantify the computing power your app uses. Every time your application performs an action on the server, Bubble runs a calculation and deducts WUs from your monthly plan.

On the entry-level Starter plan ($69 per month), you get 175,000 WUs. On the Growth plan ($249 per month), you get 250,000 WUs. These numbers sound large until you look at how quickly they disappear.

WUs are consumed by:

  • Database Operations: Reading, writing, modifying, or deleting records.
  • Workflow Triggers: Running custom actions, conditions, or backend workflows.
  • API Connector Requests: Sending data to external services or receiving webhooks.
  • Page Overhead: Loading visual components and running initial page workflows.

The issue with this model isn’t the price itself, but the volatility. Because Bubble’s visual programming language is compiled on their servers, you have no direct control over how efficiently that code executes. A single search query that isn’t optimized can scan your entire database, consuming thousands of WUs in a few seconds.

The mechanics of AI API and token pricing

On the other side of the economics equation lies token-based pricing. If you build an application using AI code generators like Lovable or Bolt, or if you write custom code that interfaces with raw LLMs, your billing is determined by input and output tokens.

Tokens are the basic units of text processed by an AI. A thousand tokens equal roughly 750 words. The pricing model here is linear and developer-centric:

  • Development Costs: You pay for tokens when you prompt the AI to write or edit your application’s code. Once the code is written, it runs on standard cloud infrastructure (like Vercel, Supabase, or Netlify), which is typically free or flat-rate for moderate traffic.
  • Runtime AI Costs: If your app uses AI features - like a chat interface or automated document summarization - you pay the model provider (such as Anthropic or OpenAI) for every user interaction.

This means your hosting costs remain low and predictable, while your development costs are variable. However, if your application runs in an automated loop, or if you enter a regression debugging loop during development, your token consumption can rise dramatically.

Comparing the economic models

Let’s look at how these pricing components compare across common operational tasks.

Operational TaskBubble Workload Units (WUs)AI API / Token Pricing
Database QueriesConsumes WUs per record fetched. Large tables or nested searches trigger high consumption.Paid via database host (flat-rate or storage-based, usually cheap).
Business LogicWorkflows and visual triggers consume WUs for each step. Schedulers add continuous drain.Executed via serverless functions (highly affordable, often free for millions of runs).
External IntegrationsAPI Connector consumes WUs proportional to the size of the payload.Standard API call charges (negligible network transit fees).
AI FeaturesConsumes WUs for the API Connector plus direct AI provider token fees.Linear billing based on exact input and output token counts.
App MaintenanceFlat subscription cost for editor access, no charge for visual updates.Variable credit or token costs to prompt the AI for bug fixes and changes.

Database scaling: the real cost driver

Databases are the most common source of billing surprises. In Bubble, data structure optimization is mandatory. If you create a data type called “Project” and link it to a list of “Tasks,” running a search for projects will load the associated tasks. Bubble’s server handles this relational mapping visually, which consumes WUs for every record loaded. If you have 5,000 tasks and query them without strict constraints, you will see your WU balance drop rapidly.

In contrast, if your application uses a dedicated database like PostgreSQL or Supabase, scaling is based on compute resources or data storage size. A query that searches through 10,000 records takes a few milliseconds of CPU time. Standard database hosts charge you for the storage space (such as $10 per month for several gigabytes) rather than counting every individual read or write.

Logic triggers and workflows: predictable vs variable cost

Running logic triggers highlights the efficiency gap between proprietary visual platforms and standard serverless infrastructure.

In Bubble, every logic workflow runs on Bubble’s server cluster. If you set up a workflow that runs when a user clicks a button, evaluates three conditions, updates a record, and sends an email, each step costs WUs. If you run a backend workflow to process data in bulk, it will execute loops that consume thousands of WUs. Builders frequently complain that normal business logic costs too much to run in production.

With an AI-generated app, your backend logic is typically compiled into lightweight JavaScript or Python code. This code runs on serverless platforms where you get millions of free execution milliseconds every month. You don’t pay for clicking a button or evaluating an “if” statement. You only pay if you make direct calls to an external AI model during that execution, which scales linearly.

The hybrid alternative with Softr

If you want to build a functional business application without getting stuck in WU calculations or constant prompting loops, a hybrid platform like Softr offers a practical solution.

Softr operates on a predictable subscription model. You pay a flat monthly rate based on the plan you choose (Starter, Basic, Professional, or Business). This plan gives you set limits on the number of app users and database records, rather than charging you for every search query or page view.

Softr handles database scaling and logic triggers through a hybrid structure:

  • Native Relational Databases: Softr includes a database built for business operations. You can import records, establish relationships, and run queries without worrying about usage-based penalties.
  • AI Co-Builder: You can use the AI Co-Builder to generate pages, databases, and workflows quickly. While this utilizes a monthly credit allocation, it is completely optional. You can build, edit, and maintain your entire application visually inside the studio without using any credits.
  • Isolated Vibe Coding: If you need custom components, you can use the Vibe Coding block to generate specific elements without regenerating the entire application codebase.

This approach gives you the speed of AI generation with the cost predictability of flat-rate hosting. You won’t face sudden bills because a user loaded a dashboard or triggered a basic logic workflow.

Practical advice for builders

When deciding how to allocate your budget, keep these guidelines in mind:

  1. Choose flat-rate options for high-traffic internal tools: If your team is constantly loading tables and updating records, a flat-rate platform like Softr will protect you from WU spikes.
  2. Optimize database queries early: If you must build on Bubble, use constraints on all searches to avoid loading unnecessary fields or relationships.
  3. Isolate your AI features: If you are integrating LLM calls, set up rate limits and cache responses to avoid paying for duplicate prompts.

By matching your application’s architecture to the right economic model, you can build tools that remain affordable as your business grows.