D2C E-commercePro tierPro tier — productized build + monthly iterationShipped in 11 days, monthly retainer ongoing

RAG-powered merchandising assistant for a D2C apparel brand — Pro tier

Client: Northstrand Apparel (D2C, 2,100 SKU catalog, $40M+ GMV)

Replaced a hand-maintained 'goes-well-with' spreadsheet with a RAG assistant the merchandising lead queries in natural language. Reads from Shopify + 26 months of clickstream + the brand's own copy guidelines. Pro tier ($400/mo). Lifted average order value $9.80 against a six-week holdout.

cost per merchandiser query: $0.018
first production deploy: 11 days
retrieval relevance @ top-5 (eval set): 92%
return rate on recommended items: -7.4%

Challenge

The merchandising lead was personally maintaining a 2,100-row spreadsheet of cross-sell pairings. Return rate on recommended items had drifted to 22% (six points over store average) as the catalog grew. Two SaaS personalization vendors quoted $240k+/year on annual contracts; both required a 60-day integration before any signal.

The ask wasn't a recommendation engine — it was a tool the merchandiser could query ("what should I pair with the new linen shorts?") and trust the answer. RAG with citations into the actual catalog + clickstream beats both the spreadsheet and the SaaS demos because the merchandiser can see *why* the assistant suggested what it did.

Approach

Pro tier build, 11 days. Day 1-3: Pipeline from Shopify + clickstream warehouse into a hybrid index (BM25 + dense embeddings on catalog metadata, image alt-text, and 26 months of co-purchase signal). Day 4-7: Built the assistant as a single-turn RAG with strict citation — every suggested SKU comes back with the retrieval reason (co-purchase frequency, embedding similarity, copy-rule match). Day 8-9: Built an eval set of 180 merchandiser queries with the lead, scored top-5 retrieval relevance by hand, tuned reranker. Day 10-11: Slack integration and the Pro-tier dashboard (cost-per-query, retrieval accuracy, suggested-vs-accepted rate).

A/B framework ran a 15% holdout from week 3 through week 9, instrumenting AOV, revenue-per-session, and return rate on recommended items.

Outcome

AOV in the treatment arm landed $9.80 above holdout over the six-week test. Return rate on recommended items fell from 22% to 14.6%. Merchandiser queries cost $0.018 each at scale — the team runs roughly 200/day, so the assistant costs ~$110/month against the $400/mo Pro retainer. The merchandising lead reclaimed roughly 30 hours/month previously spent on the spreadsheet. The retainer continues; we're scoping a Custom-tier build for a multi-tenant version they want to license to two sister brands.

Stack

Claude Sonnet (RAG)
Hybrid retrieval (BM25 + OpenAI embeddings)
Cohere reranker
Shopify + BigQuery pipeline
Slack assistant surface

Working on something similar?

A partner will respond personally within one business day. If there isn't a fit, we'll tell you that, and point you somewhere better.

Start a conversation More case studies