This repository contains a tutorial on process mining using PM4Py and JIRA data. Process mining is a technique that allows organizations to analyze their business processes based on event logs extracted from their information systems.
This tutorial guides you through the fundamentals of process mining and demonstrates how to apply these techniques to JIRA data. By analyzing JIRA changelog data, you can discover, monitor, and improve real processes by extracting knowledge from event logs.
- Python 3.7+
- Visual Studio Code
- Basic understanding of data analysis concepts
-
Install Visual Studio Code Download Visual Studio Code from the official website: https://code.visualstudio.com/
-
Add Cody AI extension (optional but recommended)
- In VS Code, click on the Extensions icon in the left sidebar
- In the search bar, type "Cody AI"
- Look for the Cody AI extension by Sourcegraph
- Click the "Install" button
- Configure Cody AI following any prompts provided by the extension
-
Install required Python packages
pip install pm4py pandas matplotlib jupyter
This tutorial covers:
-
Introduction to Process Mining
- Basic concepts and terminology
- Applications and use cases
-
Working with PM4Py
- Setting up the environment
- Core functions and capabilities
-
Extracting and Preparing JIRA Data
- Accessing JIRA changelog data
- Transforming JIRA data into event logs
-
Process Discovery
- Creating process models from event logs
- Visualizing process flows
-
Process Analysis
- Identifying bottlenecks
- Analyzing process variants
- Performance metrics
-
Advanced Techniques
- Conformance checking
- Process enhancement
- Clone this repository
- Open the
intro_to_process_mining_tutorial.ipynbnotebook in Visual Studio Code - Follow the step-by-step instructions in the notebook
The repository includes sample JIRA changelog data in JSON format that you can use to follow along with the tutorial. This data represents typical issue lifecycle events in a software development project.