登录查看更多内容

Mining Jira with Jupyter: Extracting and rendering hidden process information with Jira API and Python

William Kennedy

Heat + Pressure + Data = ??

发布日期: 2021年12月17日

For many of us, Jira is the "Source of Truth" for all Planning, Development, Quality Assurance and even the Triage and Resolution of support and feature requests.

We define Processes and build Workflows, sometimes in tandem. Performing Retrospectives on these to gauge alignment or find fault in the resulting processes-at-scale is hard to do, and a roadmap to perform one isn't clearly defined. This article alone won't change that, but I do hope to fan the flames of Process Discovery and Mapping for Jira.

Here are the problems as I saw them:

I needed to get a sense of where cross-functional and cross-organization transactions were taking place, using the constructs of a Jira Project (IssueType, Status, Resolution, Creation Date, et cetera)
I needed to extract the necessary data from Jira to tell the story as part of a data pipeline, not a one-off "Click-a-thon", or static pane in a report
I wanted to bypass heavy tools like Tableau or Looker
Cloud and SaaS tools were off the table
The use cases and required data were likely to change often
Personal choice to make something to share and empower other teams to address other challenges
Personal choice to make the solution something that can be embodied as code, committed, versioned and offer some control via permission

Before continuing, I'd like to acknowledge the work that inspired this article. By combining the approaches in these articles, I have been able to make and share some eye opening discoveries about the status of my Jira Projects, Issues and the dynamics of the teams working there.

Process Mining in Jira by Kjell Tore Guttormsen

https://www.dhirubhai.net/pulse/process-mining-jira-kjell-tore-guttormsen/

Using Jupyter Notebooks to Access Jira by Michael March/Isos Technology

https://blog.isostech.com/atlassian/using-jupyter-notebooks-to-access-jira

Reading and Visualizing Data from Jira by Sergei Dmitriev

https://towardsdatascience.com/communication-story-from-an-issue-tracking-software-efbbf29736ff

Tools

As our careers advance and we face different challenges in Computer Science and Information Technology, we often solve the problem with software. The advancement of the science and software you discover on one project may weave its way into a future challenge and offer new capabilities. That's what happened here.

Disco: While following up on the state of Customer Journey Mapping tools like XESame, I came across the Process Mining article above. In it, Kjell discussed their challenge, some findings and a tantalizing diagram produced with Fluxicon's Disco software based on Jira data like: issue key, summary, assignee, date of status transition, status, department, and a few other properties.

Jupyter: The Disco article discussed using SQL queries to extract the necessary data, something that would make some Admins seethe. Not an option here, either, and while there are several apps for Jira like Midori Better Exporter, which can export the desired changelog data, it's not enough to carry a recommendation to Production. I prefer to use the API and the Python Jira API wrapper delivers on the general utility and appeal for other use cases. IPython/Jupyter notebooks have been around for almost a decade now, and I've always looked for a practical application for them, and lo!

领英推荐

How to Create + Execute a Successful Camunda Proof of…

Camunda 1 年前

10X Engineering - Schema Driven Development: The first…

Ayush Ghai 11 个月前

AI… The speed of things to come

Jazz Kang 2 个月前

!pip install jira

More Jupyter & hello to Gephi: Michael's article (though lacking a Notebook to deploy) offers a good example of how to get started using Jupyter and Jira together. Tip: If you're looking for a quick way to go from 0 to Jira data in a Pandas Dataframe, consider his Docker method.

If you have a Jupyter environment (even if you're a Cloud customer) you can step up your analysis by exploring Sergei's excellent step-by-step example (there are a few sneaky code edits to make that aren't discussed in the article - I will try to comment on them below) to get up and running with a Jira data extract containing the desired changelog data:

# Read data from Jira with changelog

jira_search = jira.search_issues(jql, startAt=block_num*block_size, maxResults=block_size, 
                                 fields="issuetype, created, resolutiondate, reporter, assignee, status", 
                                 expand='changelog')


# Get information from changelog
history_assignee = []
histories = issue.raw['changelog'].get('histories', None)
if histories is not None:
    for history in histories:
        for item in history['items']:
            if item['field'] == 'assignee':
                # Get history author, previous assignee, new assignee
                history_author = history.get('author', None)
                if history_author is not None:
                    history_author = history_author['key']
                history_assignee.append([history_author, item['from'], item['to'], datetime.strptime(history['created'][:19], "%Y-%m-%dT%H:%M:%S")])g

By using?search_issues?method we can read not only fields, but also, for example, changelog. For doing this, parameter?expand='changelog'?must be passed. Changelog is interesting in terms of statuses or assignees history. Suppose, we would like to know assignees history for a given issue (who made change, previous assignee, new assignee, change date and time)

This change and history data in CSV can be used not only in Disco, but the example here uses the NetworkX library to generate a GraphML file which can be in the (frankly very entertaining) Gephi tool, where it is possible to explore and filter these data from Jira in very insightful ways not currently possible (or desirable) in the app:

Looking ahead

The appoach shows a lot of promise as a way for Admin and Consulting teams to deliver and share tools with a lot of value, particularly for situations where out-of-the box software either is cost-prohibitive or too limited.

We often "meet users where they are" which can mean Excel & CSV files and Jira Filters.

Even if you only ever pull Issues out into a table or make a call to your favorite REST APIs, you can save the notebook, copy it, commit it to a git repository, share it with a friend, post it to Slack, email it, blog about it or forget about it until next year when you might have otherwise forgotten how you built it (or the Confluence page wasn't sufficient).

Speaking of Confluence, did you know you can also view Jupyter Notebooks there?

Additional Links

https://pypi.org/project/jira/
https://gephi.org/
https://ipython.org/
https://marketplace.atlassian.com/apps/1214144/jupyter-viewer-for-confluence?hosting=cloud&tab=overview

William Kennedy

Heat + Pressure + Data = ??

3 年

Here is a GitHub repo containing the GraphML example with the edits made that I mentioned. You can also refer to this pasted-together version of Sergei's code if you want to learn along. Happy hacking! https://github.com/wjkennedy/JupyterNotebooks/blob/master/JiraJupyterPython-Gephi_sdmitriev.ipynb

要查看或添加评论，请登录

William Kennedy的更多文章

Data, Algorithms, Analysis, Storage & Me

2025年1月9日

Data, Algorithms, Analysis, Storage & Me

It all started with the phone book. It was my “booster seat” at the dinner table and our Bell Yellow Pages featured…
Incident Management, Support Requests and Me

2024年12月31日

Incident Management, Support Requests and Me

I've been doing incident management and ITSM systems for a long time now. The first one came in the form of a PHP tool…
A9 Arcade: Microservices, GPTs, Agents, SPAs, Automations, One-offs, One-liners and more.

2024年6月13日

A9 Arcade: Microservices, GPTs, Agents, SPAs, Automations, One-offs, One-liners and more.

After a quick win building a general purpose Streamlit app to tackle a data translation challenge, I went for a few…

1 条评论
Sorting my toolbox

2024年3月22日

Sorting my toolbox

I'm looking through my Trello boards to see what might need revisiting or permanent deletion and I thought I'd share my…
Knowledge is Power: GPT Pro Knowledge for Atlassian Solutions

2024年3月17日

Knowledge is Power: GPT Pro Knowledge for Atlassian Solutions

What Lately I've been experimenting with knowledge files and LLMs for RAG. One of the interesting problems is expert…
Mastering Traceability and Observability with MELT: Your Guide to Squashing Heisenbugs

2023年10月18日

Mastering Traceability and Observability with MELT: Your Guide to Squashing Heisenbugs

Introduction: The Elusive Heisenbugs We've all been there. You're in a Project Management Office (PMO) or Engineering…
Reason to migrate to Jira Cloud #487 or 'Understanding the Network Effect and Decay in Software Development: A Case Study of Jira'

2023年7月7日

Reason to migrate to Jira Cloud #487 or 'Understanding the Network Effect and Decay in Software Development: A Case Study of Jira'

2023-07-05 21:04:31 Executing: /usr/share/atlassian-plugin-sdk-8.2.
The A9 Jira Data Generation Cookbook or "It's 2023, do you know where your example Jira data is?"

2023年7月6日

The A9 Jira Data Generation Cookbook or "It's 2023, do you know where your example Jira data is?"

If you've use the Jira data generator on server or data center, you know it has powerful metadata generation…
Retool and Jira: A perfect pair

2021年7月15日

Retool and Jira: A perfect pair

What could make Jira better? Perhaps a way to quickly build and share apps built on Jira data and blended data sources?…
Integrating Jira and Google Data Studio - Part 2

2021年2月12日

Integrating Jira and Google Data Studio - Part 2

This is the second article in a series on using Google Data Studio with Jira Cloud. See the first installment…

See all articles

Mining Jira with Jupyter: Extracting and rendering hidden process information with Jira API and Python

William Kennedy

Heat + Pressure + Data = ??

Process Mining in Jira by Kjell Tore Guttormsen

Using Jupyter Notebooks to Access Jira by Michael March/Isos Technology

Reading and Visualizing Data from Jira by Sergei Dmitriev

Tools

领英推荐

Looking ahead

Additional Links

William Kennedy的更多文章

社区洞察

其他会员也浏览了

Alternatives to Kustomize for Kubernetes Configuration Management

CRken & GitLab: Perfect Pair for AI Code Maintenance

A Comprehensive Guide to Logging: From Beginner to Advanced

AI-Driven Code Automation: Are Developers Being Replaced?

Navigating Docker Entrypoint Script Issues: A Guide to Permissions and Best Practices

UpTeam Pulse #2

Building a Slack Bot with Python and Flask for Kubernetes Management

Modern Python project management with uv and Databricks Asset Bundles

Is AI Code Automation Contributing to Code Complexity?

Service Mesh Explained in 5 minutes for Developers

Process Mining in Jira by Kjell Tore Guttormsen

Using Jupyter Notebooks to Access Jira by Michael March/Isos Technology

Reading and Visualizing Data from Jira by Sergei Dmitriev

Tools

领英推荐

Looking ahead

Additional Links

William Kennedy的更多文章

Data, Algorithms, Analysis, Storage & Me

Incident Management, Support Requests and Me

A9 Arcade: Microservices, GPTs, Agents, SPAs, Automations, One-offs, One-liners and more.

Sorting my toolbox

Knowledge is Power: GPT Pro Knowledge for Atlassian Solutions

Mastering Traceability and Observability with MELT: Your Guide to Squashing Heisenbugs

Reason to migrate to Jira Cloud #487 or 'Understanding the Network Effect and Decay in Software Development: A Case Study of Jira'

The A9 Jira Data Generation Cookbook or "It's 2023, do you know where your example Jira data is?"

Retool and Jira: A perfect pair

Integrating Jira and Google Data Studio - Part 2

社区洞察

其他会员也浏览了

Alternatives to Kustomize for Kubernetes Configuration Management

CRken & GitLab: Perfect Pair for AI Code Maintenance

A Comprehensive Guide to Logging: From Beginner to Advanced

AI-Driven Code Automation: Are Developers Being Replaced?

Navigating Docker Entrypoint Script Issues: A Guide to Permissions and Best Practices

UpTeam Pulse #2

Building a Slack Bot with Python and Flask for Kubernetes Management

Modern Python project management with uv and Databricks Asset Bundles

Is AI Code Automation Contributing to Code Complexity?

Service Mesh Explained in 5 minutes for Developers