qa(plan): play every depth x question-count + consumer mindset; add Future.md (QA backlog)

- Pass B: cover the full depth x round-length matrix (Light/Everyday/Deep/All x 5/10/15), not one combo; short+long, shallow+deep, every answer type. - Methodology: THINK AS A CONSUMER (approach from many angles); capture works-but-could-be-better / feature ideas to Future.md '## QA' (kept separate from the ClaudeReport.md bug log). - New Future.md seeded with 5 grounded QA improvement ideas (inclusive onboarding options, turn-aware 'waiting to play' copy, rate-limit exemption for high-value pushes, suppress redundant results push, friendlier paywall error state). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-25 13:39:16 -05:00 · 2026-06-25 13:39:16 -05:00 · 2545da5c8b
parent 83d3d59903
commit 2545da5c8b
2 changed files with 42 additions and 2 deletions
--- a/ClaudeQAPlan.md
+++ b/ClaudeQAPlan.md
@ -59,6 +59,16 @@ confirms + enumerates this; the fix phase applies couple-shared everywhere.
 - Premium toggled via `scratchpad/set_premium.js` (admin, **user-authorized each time**).
 - Theme toggled via **Settings → Appearance (Light/Dark)** (`MainActivity` `ThemeMode`).
 - **REPORT-ONLY during passes — never fix mid-pass.**
 - **THINK AS A CONSUMER — approach everything from different angles.** Beyond "does it work", constantly ask *"is this
  what a real person would expect / want here? is this delightful, confusing, or annoying?"* Come at each flow from
  multiple angles (first-time user, returning user, the partner who didn't start it, someone tapping fast, someone
  reading carefully, the skeptic, the impatient). Vary inputs, depths, orders, and entry points (don't repeat one
  happy path). A thing can be bug-free yet still *worse than it should be* — notice that too.
 - **CAPTURE IMPROVEMENT / FEATURE IDEAS → `Future.md` (section `## QA`).** Bugs (broken/incorrect behavior) go to
  `ClaudeReport.md` as always. But anything that *works yet could be better* — confusing copy, a missing affordance,
  a rough-but-not-broken flow, a "it'd be great if…" feature idea — append it to **`Future.md` under `## QA`** with a
  short title, what prompted it, and the suggested improvement. This is an idea backlog, **not** the bug log; logging
  here is never a substitute for filing an actual defect in `ClaudeReport.md`.
 - **Environment (senior-QA rec):** prefer the **Firebase Local Emulator Suite or a dedicated staging project** over
  production — isolates test data, makes seeding / entitlement toggles / D3 negative tests **free** (no gated prod
  writes), and avoids polluting real users. Caveat: App Check, RevenueCat IAP, and real FCM/APNs push need real
@ -156,8 +166,12 @@ Games: This or That, How Well Do You Know Me, Desire Sync, Connection Challenges
 - The session lifecycle is exercised by the real playthrough: `status` active→completed; reveal/results correct on both.
 - **VARY THE STYLE OF PLAY (don't just repeat the happy path):** across runs, deliberately exercise *different* ways a
  real couple would play each game, because different inputs hit different code paths:
-  - **Different option/length/mood choices** — every round length (5/10/15), every mood/category, the "All topics"/
+  - **Different DEPTHS and QUESTION COUNTS — cover the matrix, don't settle for one combo:** play each game across
-    shuffle option, and **each distinct answer type** (A/B, Yes/No, True/False, 1–5 scale, multi-select, free-text).
+    **every depth/mood** (Light, Everyday, Deep, All-topics/shuffle) AND **every round length / number of questions**
    (5 / 10 / 15), in *different pairings* across runs (e.g. Light×5, Deep×15, Everyday×10, All×5) — short *and* long
    sessions, shallow *and* deep content. Different depths surface different question sets, tones, and edge content
    (e.g. Deep/Desire-Sync sensitive prompts); different counts stress pacing, progress, and the both-answered gate.
    Also exercise **each distinct answer type** (A/B, Yes/No, True/False, 1–5 scale, multi-select, free-text).
  - **Different answer *patterns* that change the result** — all-match vs all-mismatch vs partial; both-yes vs both-no
    vs split (so reveals show "shared", "all private", "0 matches", "perfect/zero score" — verify each renders right).
  - **Different turn orders / who-starts** — partner A starts vs partner B starts; the guesser opens before vs after
--- a/Future.md
+++ b/Future.md
@ -0,0 +1,26 @@
 # Future — ideas & improvements backlog
 Non-blocking ideas: things that work today but could be better, plus feature ideas. Actual bugs
 (broken/incorrect behavior) live in `ClaudeReport.md`, not here.
 ## QA
 Improvement & feature ideas surfaced while QA-testing as a consumer (each works today — none are defects).
 - **Inclusive sex/gender options in onboarding.** Profile step 2 ("What's your sex?") offers only **Female / Male**,
  and it drives Desire Sync question tailoring. For a couples app, consider adding a non-binary / "prefer to
  self-describe" / "prefer not to say" option (with a sensible tailoring fallback). *Prompted by:* Pass G sign-up flow.
 - **Home "waiting to play" copy doesn't say whose turn it is.** Both partners' Home shows "**Your** partner is waiting
  to play" for the *same* session, so each thinks the other is mid-game. Consider turn-aware copy ("Your turn —
  finish your part" vs "Waiting on {partner}"). *Prompted by:* B-002 (resume is fixed; this is the copy nuance).
 - **Exempt high-value/user-initiated pushes from the promotional rate limit.** Game-completion ("results ready") and
  similar pushes share the same 20/day, 100/week cap as reminders/re-engagement. During normal heavy use a legitimate
  results-ready push was suppressed. Consider a separate (higher/uncapped) budget for partner-action/results pushes vs
  promotional reminders. *Prompted by:* Round 5 E-003 results-ready verification.
 - **Suppress the redundant "tap to see results" push when the recipient is already on the results screen.** When both
  partners finish a game at nearly the same time, both are already viewing results yet both still receive the
  results-ready push. Mirror the existing chat-message guard (`activeThreadMonitor`) for the active game/results screen.
  *Prompted by:* Round 5 results-ready test.
 - **Friendlier paywall empty/error state.** A-OBS fixed the raw SDK error leak; as a follow-up, when plans genuinely
  can't load, consider a retry-with-backoff + an offline-aware message, and hide the disabled "Continue" until plans
  load. *Prompted by:* A-OBS (env had no RevenueCat sandbox, but the state is worth polishing for real failures).