Examples¶
This guide provides practical Aniwa usage examples for common profiling workflows.
These examples demonstrate:
- dataset inspection
- report generation
- configuration usage
- automation workflows
- profiling strategies
Basic Dataset Profiling¶
Profile a CSV dataset:
Profile an Excel dataset:
Profile a JSON dataset:
Profile a Parquet dataset:
Fast Profiling¶
Use lightweight profiling for speed:
Best for:
- quick inspections
- CI validation
- very large datasets
Deep Profiling¶
Run comprehensive profiling:
Best for:
- audits
- production validation
- deep analysis
JSON Reports¶
Generate machine-readable reports:
Use cases:
- automation
- APIs
- pipelines
- integrations
HTML Reports¶
Generate shareable HTML reports:
Best for:
- stakeholders
- sharing
- auditing
- visualization
PDF Reports¶
Generate printable reports:
Best for:
- compliance
- documentation
- management reviews
Markdown Reports¶
Generate GitHub-friendly reports:
Best for:
- documentation
- GitHub issues
- pull requests
Excel Reports¶
Generate spreadsheet reports:
Best for:
- analysts
- spreadsheet workflows
- exports
Using Templates¶
Dark Template¶
Enterprise Template¶
Compact Template¶
Output Directories¶
Generate reports into folders:
Generated output:
Include Specific Sections¶
Generate focused reports:
Exclude Sections¶
Remove unnecessary sections:
Configuration File Examples¶
YAML Workflow¶
aniwa.yaml
mode: deep
report:
format: html
template: enterprise
output_dir: reports/
sections:
include:
- summary
- statistics
- insights
Run:
Custom Config File¶
Automation Examples¶
Nightly Profiling¶
CI Validation¶
Audit Workflow¶
Example Profiling Insights¶
Aniwa can generate insights like:
Example Enterprise Workflow¶
Step 1 — Initial Inspection¶
Step 2 — Generate HTML Report¶
Step 3 — Generate PDF Audit¶
Step 4 — Standardize with Config¶
aniwa.yaml
Step 5 — Automate¶
Example Team Structure¶
Example Data Engineering Workflow¶
Aniwa fits naturally into:
Example Governance Workflow¶
Aniwa can support:
- dataset audits
- trust reviews
- onboarding datasets
- quality validation
- metadata inspection
Future Workflow Examples¶
Future versions of Aniwa may support:
- database profiling
- distributed profiling
- observability integration
- AI-assisted insights
- dataset lineage
- semantic understanding
- trust scoring
Recommended Practice¶
A recommended workflow:
- inspect datasets immediately
- generate reusable reports
- standardize configs
- automate validation
- integrate into CI/CD
Next Steps¶
Continue with:
- cli-reference.md
- profiling-modes.md
- report-formats.md
- architecture/overview.md