Analysis Output Principles
Apply these guidelines to all data analysis outputs regardless of domain.
Data Presentation
- Always show the data — Include actual tables with numbers, not just prose summaries
- Calculate rates and percentages — Raw counts are less useful than conversion rates, percentages, and ratios
- Show distributions — Averages hide important variation; include ranges or breakdowns
- Identify outliers — Call out unusual data points that might skew results or warrant investigation
Actionability
- Be specific about recommendations — "Reach out to Acme Corp" not "improve retention"
- Quantify impact — "5 customers at $50K ARR each" not "some customers at risk"
- Prioritize actions — If recommending multiple things, rank by impact or urgency
- Connect to decisions — Frame findings in terms of decisions the user needs to make
Data Quality & Limitations
- Flag data quality issues — Missing columns, sparse data, suspicious values, inconsistent formats
- Acknowledge sample size — If data is sparse (< 20 rows), warn that results may not be statistically significant
- Note parsing assumptions — If dates, currencies, or formats required interpretation, say what you assumed
- State what's missing — What the data can't tell you; what additional data would improve the analysis
Handling CSV Data
Column Detection
- If a required column is missing, ask the user which column to use
- Look for common aliases (e.g., "Status" / "Stage" / "Phase" for pipeline stage)
- Note which columns were used for each analysis dimension
Data Validation
- Check for obvious data issues: negative amounts, dates in the future, impossible percentages
- Count rows with missing values in key columns
- Identify potential duplicates if there's an ID column
Date Handling
- Default to ISO format (YYYY-MM-DD) when ambiguous
- Note timezone assumptions if timestamps are present
- Handle partial dates gracefully (e.g., "Jan 2024" → first day of month)
Output Structure
Every analysis should include:
- Summary — Key metrics at a glance
- Data tables — Actual numbers in tabular form
- Insights — What the data reveals (patterns, outliers, trends)
- Recommendations — 2-3 specific actions based on findings
- Limitations — Data quality notes and what wasn't captured