Overview
Python library for conveniently constructing and executing Machine Learning (ML) pipelines represented by Knowledge Graphs (KGs). It features a coding interface and a CLI, and allows the user to:
- Construct an ML pipeline that gets a CSV as input and processes the data using any of the available tasks and methods.
- Save the constructed pipeline as a KG in Turtle format.
- Execute the generated KG.
The coding interface is demonstrated with three sample Python files. The pipelines represented by the generated sample KGs are briefly explained below:
- ML pipeline: Loads features and labels from an input CSV dataset, splits the data, trains and tests a k-NN model, and visualizes the prediction errors.
- Statistics pipeline: Loads a feature from an input CSV dataset, normalizes it, and plots its values (before and after normalization) using a scatter plot.
- Visualization pipeline: Loads a feature from an input CSV dataset and plots its values using a line plot.
Under the hood, ExeKGLib uses well-known Python libraries for data processing and visualization and performing predictions such as pandas, matplotlib, and scikit-learn.
ExeKGLib is described in the following paper published as part of ESWC 2023:
Klironomos A., Zhou B., Tan Z., Zheng Z., Gad-Elrab M., Paulheim H., Kharlamov E. ExeKGLib: Knowledge Graphs-Empowered Machine Learning Analytics
Last update: January 7, 2023