pm4py.statistics.rework.cases.pandas package#

PM4Py – A Process Mining Library for Python

Copyright (C) 2024 Process Intelligence Solutions UG (haftungsbeschränkt)

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see this software project’s root or visit <https://www.gnu.org/licenses/>.

Website: https://processintelligence.solutions Contact: info@processintelligence.solutions

Submodules#

pm4py.statistics.rework.cases.pandas.get module#

PM4Py – A Process Mining Library for Python

Copyright (C) 2024 Process Intelligence Solutions UG (haftungsbeschränkt)

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see this software project’s root or visit <https://www.gnu.org/licenses/>.

Website: https://processintelligence.solutions Contact: info@processintelligence.solutions

class pm4py.statistics.rework.cases.pandas.get.Parameters(value, names=<not given>, *values, module=None, qualname=None, type=None, start=1, boundary=None)[source]#

Bases: Enum

ACTIVITY_KEY = 'pm4py:param:activity_key'#
CASE_ID_KEY = 'pm4py:param:case_id_key'#
pm4py.statistics.rework.cases.pandas.get.apply(df: DataFrame, parameters: Dict[str | Parameters, Any] | None = None) Dict[str, Dict[str, int]][source]#

Computes for each trace of the event log how much rework occurs. The rework is computed as the difference between the total number of activities of a trace and the number of unique activities.

Parameters#

df

Pandas dataframe

parameters

Parameters of the algorithm, including: - Parameters.ACTIVITY_KEY => the activity key - Parameters.CASE_ID_KEY => the case identifier attribute

Returns#

dict

Dictionary associating to each case ID: - The number of total activities of the case (number of events) - The rework (difference between the total number of activities of a trace and the number of unique activities)