itsulu-blog-publisher/PHASE3_ROADMAP.md at acfa1d93d7fa467cd2f96c561e7d8fe75e3b751b

Nicholas Riegel acfa1d93d7 feat: establish Phase 3 E2E testing infrastructure with Playwright

Created comprehensive E2E test suite for ITSulu Blog Publisher using Playwright
and Runboat. Includes:

PHASE3_ROADMAP.md:
- Goals for E2E coverage (10-30 scenarios)
- Performance benchmark targets (latency, tokens, queries, throughput)
- Implementation plan with layer-by-layer breakdown
- Success criteria and SLO targets
- Runboat integration details for CI/CD

e2e/ directory structure:
- conftest.py: Runboat polling, auth fixtures, page fixture
- requirements.txt: pytest, playwright, requests
- test_generation.py: On-demand generation workflows (5 tests)
- test_scheduling.py: Schedule slot configuration and execution (6 tests)
- test_error_recovery.py: Error handling and email notifications (8 tests)

Total: 19 E2E test scenarios covering:
- On-demand post generation with auto-publish
- Scheduled generation with topic queue
- Error recovery and retry mechanism
- Email notifications with correct content
- Social media copy generation
- Concurrent post generation
- Progress feedback during API calls

Tests use:
- Playwright sync API with 30s timeout (Odoo JS rendering)
- Runboat polling with 180s timeout (instance cold-start)
- Session-scoped auth to avoid repeated 30s logins
- Data-test-id selectors where available, fallback to get_by_*
- Proper wait_for_load_state() for async operations

Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com>

Metric	Target	Rationale
Generation latency (P50)	< 30 seconds	User experience (wizard response time)
Generation latency (P99)	< 60 seconds	Outlier tolerance
Tokens per post	800–1200	Cost baseline for budget planning
Queries per generation	< 50	N+1 detection and DB load
Concurrent posts	5+	Peak capacity without degradation
Email send latency	< 5 seconds	Notification responsiveness
Template DB prime time	< 60 seconds	CI/CD pipeline efficiency

Week	Task	Owner
W1	Set up e2e/ directory, conftest.py, Runboat polling	Claude
W1	Implement 3–5 core E2E scenarios (generation, scheduling)	Claude
W2	Add error recovery and email scenarios	Claude
W2	Set up performance measurement (latency, queries)	Claude
W3	Stress testing and concurrency verification	Claude
W3	Performance tuning if SLOs not met	Claude
W4	Runboat CI/CD integration	Claude
W4	Final verification and documentation	Claude

9.8 KiB

Raw Blame History

Phase 3: Runboat E2E Testing and Performance Benchmarks

Goals

1. E2E Test Coverage (10–30 scenarios)

2. Performance Benchmarks

3. Load Testing

Implementation Plan

Layer 1: Runboat Setup & E2E Infrastructure

Layer 2: E2E Test Scenarios (10–20 tests)

Layer 3: Performance Benchmarks

Runboat Integration

What is Runboat?

CI/CD Integration

Success Criteria

Performance SLO Targets

Implementation Timeline

Known Constraints

Runboat Limitations

E2E Test Maintenance

References

9.8 KiB Raw Blame History Unescape Escape

Phase 3: Runboat E2E Testing and Performance Benchmarks

Goals

1. E2E Test Coverage (10–30 scenarios)

2. Performance Benchmarks

3. Load Testing

Implementation Plan

Layer 1: Runboat Setup & E2E Infrastructure

Layer 2: E2E Test Scenarios (10–20 tests)

Layer 3: Performance Benchmarks

Runboat Integration

What is Runboat?

CI/CD Integration

Success Criteria

Performance SLO Targets

Implementation Timeline

Known Constraints

Runboat Limitations

E2E Test Maintenance

References

9.8 KiB

Raw Blame History