This guide will have you analyzing healthcare processes with AI insights in under 10 minutes.
Contributors:
No installation required!
Run all cells (Runtime → Run all)
Done! You’ll see:
# 1. Clone the repository (30 seconds)
git clone https://github.com/ki-smile/HealthProcessAI.git
cd HealthProcessAI
# 2. Install dependencies (2 minutes)
pip install pm4py pandas numpy matplotlib requests
# 3. Run quick example (1 minute)
python examples/quickstart_example.py
Sample Output:
✓ Loaded 1,000 events from 150 sepsis cases
✓ Discovered 12 distinct patient pathways
✓ Generated process visualization
✓ Most common pathway (34% of patients):
Admission → Triage → Blood Test → High Temperature → Antibiotics → Recovery
📊 Key Insights:
- Average case duration: 18.5 hours
- Early antibiotic administration reduces complications by 23%
- Temperature monitoring is critical within first 6 hours
Visual Output:
Here’s the complete code that runs in our quick start:
# Quick Start Example - Sepsis Process Analysis
from core.step1_data_loader import EventLogLoader
from core.step2_process_mining import ProcessMiner
# 1. Load sample data (15 seconds)
print("🔄 Loading sepsis progression data...")
loader = EventLogLoader("data/sepsisAgregated_Infection.csv")
data = loader.load_data()
prepared_data = loader.prepare_data()
# 2. Get quick statistics
stats = loader.get_statistics()
print(f"📊 Dataset: {stats['num_events']:,} events from {stats['num_cases']} patients")
print(f"📈 Sepsis rate: {stats.get('sepsis_rate', 0):.1%}")
# 3. Discover patient pathways (30 seconds)
print("🔍 Discovering patient pathways...")
miner = ProcessMiner()
event_log = miner.create_event_log(prepared_data)
dfg, starts, ends = miner.discover_dfg()
# 4. Find common pathways
variants = miner.discover_variants(top_k=3)
print("\n🏥 Top 3 Patient Pathways:")
for i, row in variants.iterrows():
print(f" {i+1}. {row['percentage']:.1f}% of patients ({row['cases']} cases)")
pathway = row['variant'].split(' → ')[:4] # Show first 4 steps
print(f" {' → '.join(pathway)}...")
# 5. Generate quick insights
print(f"\n💡 Key Findings:")
print(f"- Found {len(dfg)} unique activity transitions")
print(f"- Most common start: {list(starts.keys())[0]}")
print(f"- Most common end: {list(ends.keys())[0]}")
print(f"- Process coverage: Top 3 pathways cover {variants['percentage'].head(3).sum():.1f}% of cases")
print("\n✅ Quick analysis complete! Check generated files for visualizations.")
👩⚕️ → Clinician Tutorial
🐍 → Python Tutorial
1. Import Errors
# Solution: Install missing packages
pip install pm4py pandas numpy
2. Visualization Not Showing
# Solution: Install Graphviz
# Windows: Download from graphviz.org
# Mac: brew install graphviz
# Linux: apt-get install graphviz
3. Data File Not Found
# Solution: Check file path
import os
print("Current directory:", os.getcwd())
print("Data files:", os.listdir("data/"))
Dataset Overview:
- Total events: 15,214
- Unique patients: 1,050
- Unique activities: 16
- Sepsis rate: 31.4%
- Date range: 2020-01-01 to 2023-12-31
1. 34.2% of patients (359 cases)
Admission → Triage → Blood Culture → High Temperature → Antibiotics
2. 18.7% of patients (196 cases)
Admission → Triage → Lab Test → Normal Temperature → Discharge
3. 12.4% of patients (130 cases)
Admission → High Temperature → Infection → ICU Transfer → Recovery
💡 Clinical Insights Generated:
- Average progression time from admission to sepsis: 14.2 hours
- Early warning indicators: Temperature elevation + Lab abnormalities
- Critical intervention window: First 6 hours post-admission
- Successful treatment pattern: Rapid antibiotic administration
In just 10 minutes, you’ve:
✅ Loaded real healthcare data (sepsis progression events) ✅ Discovered process patterns (patient pathways through care) ✅ Generated visualizations (process flow diagrams) ✅ Extracted insights (clinical patterns and timings) ✅ Learned the framework (ready for advanced analysis)
This quick start demonstrates techniques applicable to:
🎉 Congratulations! You’re now ready to apply process mining to healthcare data.
Choose your next tutorial based on your role and interests:
This quick start guide is part of HealthProcessAI - Developed at SMAILE, Karolinska Institutet