RAG-powered merchandising assistant for a D2C apparel brand — Pro tier
Client: Northstrand Apparel (D2C, 2,100 SKU catalog, $40M+ GMV)
Replaced a hand-maintained 'goes-well-with' spreadsheet with a RAG assistant the merchandising lead queries in natural language. Reads from Shopify + 26 months of clickstream + the brand's own copy guidelines. Pro tier ($400/mo). Lifted average order value $9.80 against a six-week holdout.
- cost per merchandiser query
- $0.018
- first production deploy
- 11 days
- retrieval relevance @ top-5 (eval set)
- 92%
- return rate on recommended items
- -7.4%
Challenge
The merchandising lead was personally maintaining a 2,100-row spreadsheet of cross-sell pairings. Return rate on recommended items had drifted to 22% (six points over store average) as the catalog grew. Two SaaS personalization vendors quoted $240k+/year on annual contracts; both required a 60-day integration before any signal.
The ask wasn't a recommendation engine — it was a tool the merchandiser could query ("what should I pair with the new linen shorts?") and trust the answer. RAG with citations into the actual catalog + clickstream beats both the spreadsheet and the SaaS demos because the merchandiser can see *why* the assistant suggested what it did.
Approach
Pro tier build, 11 days. Day 1-3: Pipeline from Shopify + clickstream warehouse into a hybrid index (BM25 + dense embeddings on catalog metadata, image alt-text, and 26 months of co-purchase signal). Day 4-7: Built the assistant as a single-turn RAG with strict citation — every suggested SKU comes back with the retrieval reason (co-purchase frequency, embedding similarity, copy-rule match). Day 8-9: Built an eval set of 180 merchandiser queries with the lead, scored top-5 retrieval relevance by hand, tuned reranker. Day 10-11: Slack integration and the Pro-tier dashboard (cost-per-query, retrieval accuracy, suggested-vs-accepted rate).
A/B framework ran a 15% holdout from week 3 through week 9, instrumenting AOV, revenue-per-session, and return rate on recommended items.
Outcome
AOV in the treatment arm landed $9.80 above holdout over the six-week test. Return rate on recommended items fell from 22% to 14.6%. Merchandiser queries cost $0.018 each at scale — the team runs roughly 200/day, so the assistant costs ~$110/month against the $400/mo Pro retainer. The merchandising lead reclaimed roughly 30 hours/month previously spent on the spreadsheet. The retainer continues; we're scoping a Custom-tier build for a multi-tenant version they want to license to two sister brands.
Stack
- Claude Sonnet (RAG)
- Hybrid retrieval (BM25 + OpenAI embeddings)
- Cohere reranker
- Shopify + BigQuery pipeline
- Slack assistant surface
Working on something similar?
A partner will respond personally within one business day. If there isn't a fit, we'll tell you that, and point you somewhere better.