pm4py.algo.clustering.trace_attribute_driven package#

PM4Py – A Process Mining Library for Python

Copyright (C) 2024 Process Intelligence Solutions UG (haftungsbeschränkt)

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see this software project’s root or visit <https://www.gnu.org/licenses/>.

Website: https://processintelligence.solutions Contact: info@processintelligence.solutions

Subpackages#

Submodules#

pm4py.algo.clustering.trace_attribute_driven.algorithm module#

PM4Py – A Process Mining Library for Python

Copyright (C) 2024 Process Intelligence Solutions UG (haftungsbeschränkt)

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see this software project’s root or visit <https://www.gnu.org/licenses/>.

Website: https://processintelligence.solutions Contact: info@processintelligence.solutions

class pm4py.algo.clustering.trace_attribute_driven.algorithm.Variants(value, names=<not given>, *values, module=None, qualname=None, type=None, start=1, boundary=None)[source]#

Bases: Enum

VARIANT_DMM_LEVEN(percent, alpha)#
VARIANT_AVG_LEVEN(percent, alpha)#
VARIANT_DMM_VEC(percent, alpha)#
VARIANT_AVG_VEC(percent, alpha)#
DFG = <module 'pm4py.algo.clustering.trace_attribute_driven.dfg.dfg_dist' from 'C:\\Users\\berti\\pm4py-core\\pm4py\\algo\\clustering\\trace_attribute_driven\\dfg\\dfg_dist.py'>#
pm4py.algo.clustering.trace_attribute_driven.algorithm.bfs(tree)[source]#
pm4py.algo.clustering.trace_attribute_driven.algorithm.apply(log: ~pm4py.objects.log.obj.EventLog | ~pm4py.objects.log.obj.EventStream | ~pandas.core.frame.DataFrame, trace_attribute: str, variant=<function eval_DMM_leven>, parameters: ~typing.Dict[~typing.Any, ~typing.Any] | None = None) Any[source]#

Apply the hierarchical clustering to a log starting from a trace attribute.

MSc Thesis is available at: https://www.pads.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaalpxgft&download=1 Defense slides are available at: https://www.pads.rwth-aachen.de/global/show_document.asp?id=aaaaaaaaalpxgqx&download=1

Parameters#

log

Log

trace_attribute

Trace attribute to exploit for the clustering

variant

Variant of the algorithm to apply, possible values: - Variants.VARIANT_DMM_LEVEN (that is the default) - Variants.VARIANT_AVG_LEVEN - Variants.VARIANT_DMM_VEC - Variants.VARIANT_AVG_VEC - Variants.DFG

Returns#

tree

Hierarchical cluster tree

leafname

Root node