Shopify SimGym: Test Pricing and Copy with AI Shoppers Before You Launch

What SimGym is: AI shopper simulation, not A/B testing

SimGym is Shopify’s AI Research Preview tool released in the Winter 2026 “RenAIssance” Edition. The concept is straightforward: before your store goes live (or before you push a major change), SimGym sends hundreds of AI-powered virtual shoppers through your store and records where they get stuck, what they add to cart, and what makes them abandon checkout.

The gap between this and traditional A/B testing is significant. A/B tests require real traffic split between two variants. Statistically significant results typically take two to four weeks on a healthy store — longer if your monthly visitor count is under a few thousand. SimGym runs in hours and needs no live traffic at all.

The virtual buyers aren’t random bots clicking around. Each persona has defined parameters: budget range, purchase intent (comparison-shopping vs. impulse), device type, price sensitivity. You can configure custom personas — “mobile shopper, budget under $80, discount-sensitive” — or use SimGym’s library of built-in persona types. A typical simulation run might include 100 to 500 virtual buyers across multiple persona groups.

SimGym is currently in AI Research Preview, which means access requires an application. Not every Shopify account gets it automatically. Shopify also explicitly notes the output is for reference only and doesn’t replace real user data.

What you can test: pricing, copy, and UX flows

SimGym gives you specific behavioral data across several test types, not vague “optimize your UX” suggestions.

Test typeTypical use caseSimGym output
Pricing$49 vs $44.99 vs $52 — which converts betterSimulated conversion rate and abandon rate per price point
Product copyFeature-focused vs. scenario-focused descriptionsAdd-to-cart rate, time on page per variant
Homepage and category layoutNavigation structure, banner placement, filter orderBounce rate, click path heatmap
Checkout flowOne-page vs. multi-step, guest checkout vs. forced registrationCheckout completion rate, drop-off node
Category page filtersFilter option order, default sort rulesFilter usage rate, path to purchase

Pricing tests are probably where SimGym gets used most. A lot of merchants price by gut feel, and the $49 vs. $44.99 question is one nobody has a confident answer to before launch. The AI simulation data won’t match real-world conversion rates exactly, but it can rule out clearly underperforming options.

Copy direction tests are also worth running. “IPX5 water resistant, designed for running” vs. “Run in the rain without worrying — IPX5 rated” — AI buyers will behave differently on each version, which gives you a signal for which direction to develop further before spending money on real traffic.

How to use it: from application to your first report

Getting access is the first step. The SimGym application is under “Shopify Labs” in your Admin. Submit a short form describing your store size and what you want to test. Approval timelines vary — some merchants report a few days, others a few weeks.

Once you have access, the workflow inside the SimGym panel goes like this:

  1. Pick a test goal: pricing, copy, or UX flow
  2. Set up variants: upload two or three versions you want to compare (price points, copy text, or draft theme screenshots)
  3. Configure personas: use preset persona groups or define custom buyer profiles
  4. Set simulation scale: 100 to 500 virtual buyers — more gives you more stable results, but takes longer to run
  5. Start the simulation and wait for the report

The report focuses on simulated conversion rate, add-to-cart rate, checkout completion rate, and persona-level breakdowns (how different buyer types behaved differently across variants).

One setup detail to know: if you’re testing homepage or category page layouts, you need to save each version as a draft theme in Shopify’s theme editor first. SimGym runs against those draft themes. You can’t upload design mockups or screenshots directly.

SimGym vs traditional A/B testing: speed vs. accuracy

These two tools aren’t interchangeable — they’re suited to different stages of the same process.

DimensionSimGymTraditional A/B testing
Time to resultsHours2 to 4 weeks
Requires live trafficNoYes
Works pre-launchYesNo
Data accuracyAI simulation, reference onlyReal user behavior
Number of variantsMultiple at onceUsually two (A/B)
CostSimGym fees (low/free in Research Preview)Traffic dilution + tool cost
Best forNew stores, low-traffic storesStores with 5,000+ monthly visitors

If your store has steady traffic, traditional A/B testing is still more reliable because it measures actual human decisions. SimGym’s value is in two specific situations: you don’t have traffic yet, or you need to quickly eliminate weak options from a larger set of candidates. Running 8 variants in SimGym to find the 2 worth A/B testing is a reasonable workflow.

Where AI simulation falls short

A few scenarios where SimGym’s output deserves extra skepticism.

Emotionally driven purchases are the biggest gap. AI personas respond to defined parameters, but real buyers are affected by image lighting, a specific word in a product description, or just their mood that day. For impulse categories — jewelry, home decor, seasonal gifts — copy test results from SimGym carry more uncertainty than they would for commoditized products like cables or phone cases.

Brand trust doesn’t translate into simulations. First-time visitors to an unfamiliar independent store typically look for social proof: reviews, return policies, contact info. SimGym personas “know” your store exists and looks credible by design. Real buyers on a new store are more guarded. If your store is relatively new with few reviews, actual conversion rates may run lower than SimGym predicts.

Promotional mechanics are hard to model accurately. A “was $89, now $49, sale ends tonight” framing has real psychological pull that varies across buyer types. SimGym can approximate price sensitivity through persona parameters, but the interaction between urgency, discount depth, and anchor price is more complex than any parameter set captures cleanly.

SimGym also doesn’t currently break out mobile vs. desktop behavior separately — it runs blended simulations. If your buyer base is heavily mobile, keep that in mind when interpreting results.

SimGym is useful for filtering out bad options and catching obvious friction before you go live. It’s not a substitute for real user data, and it won’t tell you definitively which variant will win. Treat it as a pre-launch stress test, not a source of certainty.

Related Articles