Product Block 15 · Sellable today

AI Front Desk QA

Audit calls, chats, and intake notes before replacing anything.

Deploy: 1-2 days $3,000 setup $600/mo

Live demo Try the demo → https://qa.cafecito-ai.com/

Best fit: Medical offices, medspas, dental, property management, home services, and any phone-heavy business nervous about full voice automation.

⚡ Self-bootstrap · paste into Claude Code or Codex

AI Front Desk QA — build it without writing code

Drop the prompt below into Claude Code or Codex. The agent picks a phone-heavy prospect, runs an "AI Front Desk QA" audit on a sample of their public call data (or asks the human for an anonymized export), produces a one-page report classifying missed revenue / unanswered questions / language mismatch / followup gaps, and uses the audit AS the cold pitch (this block sells the pain before Block 01).

You provide

You provide: (1) prospect that's phone-heavy but not yet ready for full voice automation, (2) decision-maker email, (3) source data: option A — anonymized call log export with timestamps + duration + outcome (10-30 calls), option B — public review pattern showing call-handling complaints, option C — synthetic data the human creates matching the prospect's likely failure modes.

You get back

You get: a hosted audit report at /qa-audit/<slug> showing classified call failures + recovered-revenue estimate, weekly Resend digest set up, and a draft cold pitch with the audit URL.

Runtime & cost

Roughly 75 minutes wall-clock. ~$1 in Claude tokens.

📋 Copy the entire block below into Claude Code (`/plan`) or Codex

You are building an AI Front Desk QA (Block 15 in the Cafecito AI new-hire playbook). Full reference at https://cafecito-ai.com/new-hire/blocks/15-ai-front-desk-qa. Read it. Use plan mode. Stop at every [GATE].

INPUTS YOU NEED FROM THE HUMAN (ask before doing anything else):
1. Prospect — phone-heavy business not yet ready for full voice automation (sells the pain before Block 01)
2. Decision-maker email
3. Source data: (a) anonymized call log export OR (b) public review pattern showing call complaints OR (c) synthetic data matching their likely failures

ENVIRONMENT (verify):
- Working dir: /home/eratner/cafecito-ai
- Cloudflare account: f7a9b24f679e1d3952921ee5e72e677e

SECRETS TO CONFIRM:
- ANTHROPIC_API_KEY (Claude — call classification)
- RESEND_API_KEY (weekly digest)

THE PLAN:

STEP 1 — RESEARCH + DATA (15 min, mostly waiting on human)
- Pull prospect site. Identify primary services + their stated hours + any after-hours messaging.
- Wait for the data input. If review-pattern: scrape last 30 reviews, filter for call-handling-related complaints. If synthetic: human provides 10-30 sample call records.
[GATE 1 — show data shape + review excerpt patterns, ask "proceed?"]

STEP 2 — CLASSIFICATION RUBRIC (10 min)
- Define the 5 failure categories: (1) missed revenue (caller wanted to book, no answer), (2) language mismatch (Spanish caller, English-only response), (3) unanswered question (caller had a basic question, no info given), (4) followup gap (call ended without scheduling/confirming next step), (5) other.
- For each: criteria + example + recovered-revenue estimate per occurrence.
[GATE 2 — show rubric + sample classifications on 3 records, ask "categories right?"]

STEP 3 — SCAFFOLD WORKER (10 min)
- Create /home/eratner/cafecito-ai/qa-<prospect-slug>/.
- Worker handles: POST /admin/ingest (uploads call records or scrapes reviews), GET /qa-audit/<slug> (hosted report page), GET /admin/digest (Sunday cron triggers Resend weekly summary).
- Bindings: D1 QA_DB.
[GATE 3 — show worker.js, ask "deploy?"]

STEP 4 — RUN THE CLASSIFICATION (15 min)
- Run Claude over each call/review record with the rubric. Output: category, recovered-revenue estimate, illustrative quote.
- Aggregate: total missed-revenue dollars, % of calls in each category, top 3 patterns to fix.
[GATE 4 — show aggregated results, ask "build the report page?"]

STEP 5 — REPORT PAGE (10 min)
- Hosted page at /qa-audit/<slug>: header with the headline number ("$X estimated monthly revenue lost to call-handling failures"), 5 category breakdowns with examples, top-3 recommendations (which Block from the catalog would fix each), CTA to book a 15-min walkthrough.
- Editorial style + their accent color if findable.
[GATE 5 — show URL, ask "ready for cold pitch?"]

STEP 6 — DRAFT THE COLD PITCH (5 min)
- Email subject: "We audited your phone handling. Here's what's leaking."
- Email body: 4 sentences, lead with the headline number, link the audit page, name $1k audit / $300/mo monitoring, one yes/no close ("want to walk through the top 3 fixes Tuesday?").
[GATE 6 — show draft, ask the human to send manually]

STEP 7 — WIRE WEEKLY DIGEST (5 min)
- CF Cron Trigger: 0 14 * * 0 (Sunday 9am ET) — runs the audit again on new data, sends Resend digest with the delta.
- The digest is the retention mechanism (similar to Block 05 monthly report pattern).

STEP 8 — SHIP THE SUMMARY
- Single-line: "[BUSINESS] QA audit at [URL] · headline $[N] missed/mo · pitch sent."
- Append to /home/eratner/cafecito-ai/qa-shipped.md.

DONE.

GUARDRAILS: NEVER claim recovered-revenue numbers without showing the math. Always label as "estimated based on industry averages." Cost ceiling: $3.

01Stack▾

Cloudflare Workers
Hono.js
R2 for transcripts
D1
Claude API
OpenAI API
Resend optional

03Day-1 plan▾

A real prospect. A real demo. A real outbound message — all before 5pm.

09:00-09:30 Pick a front-desk prospect.
Look for phone-first businesses, reviews mentioning poor communication, or multilingual demand.
09:30-10:30 Create fake transcripts.
Seed great call, missed revenue call, language mismatch, incomplete intake, and compliance-risk examples.
10:30-12:30 Build upload and classifier.
Upload transcript to R2, classify in D1, and summarize issues with evidence lines.
12:30-14:00 Build weekly QA report.
Dashboard shows score, top issues, missed revenue estimate, coaching notes, and recommended scripts.
14:00-17:00 Call with audit-first wedge.
Position this as the safe first step before AI voice or front-desk automation.

04Best practices & gotchas▾

Audit before replacing.

Why: Businesses trust a QA layer faster than a full AI receptionist.
Quote evidence lines.

Why: Managers need to see why the model flagged an issue.
Do not score employees harshly from thin data.

Why: The product should improve process, not create unfair surveillance.
Handle regulated industries conservatively.

Why: Medical, legal, and financial calls need extra privacy and compliance review.

05Prompts (copy-paste)▾

Drop these into Claude Code. Replace the [BRACKETED] fields with the prospect's details.

Prompt 1 Front-desk QA classifier

Scores a call transcript with evidence and coaching notes.

Analyze this front-desk transcript.

Return JSON:
- summary
- lead_outcome
- missed_revenue_risk
- language_mismatch
- incomplete_intake_fields
- followup_needed
- compliance_risk
- evidence_lines
- coaching_note
- better_script

Be specific. Do not shame staff. Flag uncertainty.

Prompt 2 QA dashboard scaffold

Builds the transcript QA demo.

Build a Cloudflare Workers + Hono AI front desk QA demo.

Routes:
- GET / dashboard with fake weekly QA report
- POST /transcripts upload/paste transcript
- GET /transcripts/:id classification and coaching notes
- GET /weekly-report printable summary

Use fake transcripts only.
Use R2 for files if needed and D1 for classifications.
Deploy and print demo URL.

06Selling script▾

Discovery question (ask this first)

"Before we talk about AI answering calls, do you know what happens on your current calls today?"

The frame

The safest wedge is not an AI receptionist. It is QA on what already happens: where calls leak, where followup stalls, where language fit breaks, and which scripts help staff convert better.

The demo play

Show five fake transcripts and the weekly report: missed revenue, incomplete intake, bilingual gap, coaching script, and one improvement to test next week.

Objections

"I do not want staff feeling monitored."

"Then we frame it as process QA, not employee scoring. The report focuses on workflow fixes and better scripts."
"We do not record calls."

"The first pilot can use call notes, voicemail transcripts, chat logs, or sampled manual examples."
"We wanted an AI receptionist."

"This gets us there safer. We learn the actual call patterns before automating them."

The close

"Let me run a two-week QA pilot on a small sample. $3,000 setup. If the report finds recoverable revenue or clear coaching wins, we keep the weekly QA running at $600/month."

07Pricing notes▾

Anchor on audit-first trust and recovered revenue. This is often the wedge before voice automation, missed-call recovery, or intake automation.