The Patra Toolkit is a component of the Patra ModelCards framework designed to simplify the process of creating and documenting AI/ML models. It provides a structured schema that guides users in providing essential information about their models, including details about the model’s purpose, development process, and performance. The toolkit also includes features for semi-automating the capture of key information, such as fairness and explainability metrics, through integrated analysis tools. By reducing the manual effort involved in creating model cards, the Patra Toolkit encourages researchers and developers to adopt best practices for documenting their models, ultimately contributing to greater transparency and accountability in AI/ML development.

Features

Encourages Accountability Incorporate essential model information (metadata, dataset details, fairness, explainability) at training time, ensuring AI models remain transparent from development to deployment.
Semi-Automated Capture Automated Fairness and Explainability scanners compute demographic parity, equal odds, SHAP-based feature importances, etc., for easy integration into Model Cards.
Machine-Actionable Model Cards Produce a structured JSON representation for ingestion into the Patra Knowledge Base. Ideal for advanced queries on model selection, provenance, versioning, or auditing.
Flexible Repository Support Pluggable backends for storing models/artifacts on Hugging Face or GitHub, unifying the model publishing workflow.
Versioning & Model Relationship Tracking Maintain multiple versions of a model with recognized edges (e.g., revisionOf, alternateOf) using embedding-based similarity. This ensures clear lineages and easy forward/backward provenance.

Getting Started

Installing Patra Model Card

The latest version can be installed from PyPI:

pip install patra-toolkit

For local installation, clone the repository and install using:

pip install -e <local_git_dir>/patra_toolkit

Usage

Create a Model Card

Find the descriptions of the Model Card parameters in the docs/schema_description.md.

from patra_toolkit import ModelCard

mc = ModelCard(
  name="UCI Adult Data Analysis model using Tensorflow",
  version="0.1",
  short_description="UCI Adult Data analysis using Tensorflow for demonstration of Patra Model Cards.",
  full_description="We have trained a ML model using the tensorflow framework to predict income for the UCI Adult Dataset. We leverage this data to run the Patra model cards to capture metadata about the model as well as fairness and explainability metrics.",
  keywords="uci adult, tensorflow, explainability, fairness, patra",
  author="Sachith Withana",
  input_type="Tabular",
  category="classification",
  foundational_model="None"
)

# Add Model Metadata
mc.input_data = 'https://archive.ics.uci.edu/dataset/2/adult'
mc.output_data = 'https://huggingface.co/Data-to-Insight-Center/UCI-Adult'

Initialize an AI/ML Model

from patra_toolkit import AIModel

ai_model = AIModel(
  name="UCI Adult Random Forest model",
  version="0.1",
  description="Census classification problem using Random Forest",
  owner="Sachith Withana",
  location="https://github.iu.edu/swithana/mcwork/randomforest/adult_model.pkl",
  license="BSD-3 Clause",
  framework="sklearn",
  model_type="random_forest",
  test_accuracy=accuracy
)

# Populate Model Structure
ai_model.populate_model_structure(trained_model)
mc.ai_model = ai_model

# Add Custom Metrics
ai_model.add_metric("Test loss", loss)
ai_model.add_metric("Epochs", 100)
ai_model.add_metric("Batch Size", 32)
ai_model.add_metric("Optimizer", "Adam")
ai_model.add_metric("Learning Rate", 0.0001)
ai_model.add_metric("Input Shape", "(26048, 100)")

Run Fairness and Explainability Scanners

# To assess fairness, provide the sensitive feature, test data, labels, and predictions
mc.populate_bias(X_test, y_test, predictions, "gender", X_test['sex'], clf)

# To generate explainability metrics, specify the dataset, column names, model, and number of features
mc.populate_xai(X_test, x_columns, model, top_n=10)

Validate and Save the Model Card

# Capture Python package dependencies and versions
mc.populate_requirements()

# Verify the model card content against the schema
mc.validate()
mc.save(<file_path>)

Submit

When calling mc.submit(), pass the following keyword arguments:

patra_server_url (str, required) Base URL of the Patra Server’s REST API (e.g., "https://patra.example.org").
model (object or filepath, optional) - A Python object implementing a .save() method (e.g., Keras Model) — Patra Toolkit will call model.save(…) to serialize. - Or, a local file path (e.g., “./trained_model.pt”) for an existing saved model. If omitted, only the Model Card JSON is sent.
file_format (str, optional) File extension/format for saving the model. Common values: - “pt” (PyTorch) - “h5” (Keras) - “onnx” (ONNX) If you supply model, you must also specify file_format. Ignored otherwise.
model_store (str, optional) Backend for storing the model and artifacts. Valid values: - “huggingface” — Upload to a Hugging Face repository. - “github” — Upload to a GitHub repository under the Patra Server organization. Not used if no model or artifacts are provided.
inference_labels (str or list[str], optional) Path(s) to file(s) containing inference labels (e.g., class names). These are uploaded alongside the model.
artifacts (list[str], optional) List of additional file paths to upload (e.g., plots, data files, metrics). Each will be stored in the same repository.
token (str, optional) A valid TAPIS JWT (JSON Web Token) for authentication. - For a public Patra Server, omit this parameter. - For an authenticated server, supplying a valid token is mandatory; otherwise, you will get an HTTP 401 Unauthorized error.

mc.submit(
    patra_server_url=<patra_server_url>,
    model=<trained_model>,
    file_format="pt", # or "h5"
    model_store="huggingface", # or "github"
    inference_labels="labels.txt",
    artifacts=[<artifact1_path>, <artifact2_path>]
)

If a name-version conflict arises, increment mc.version. In case of failure, submit() attempts partial rollbacks to avoid orphaned uploads.

Examples

Explore the following example notebooks and model cards to learn more about how to use the Patra Model Card Toolkit: Notebook Example, Model Card Example

License

The Patra Model Card toolkit is developed by Indiana University and distributed under the BSD 3-Clause License. See LICENSE.txt for more details.

Acknowledgements

This research is funded in part through the National Science Foundation under award #2112606, AI Institute for Intelligent CyberInfrastructure with Computational Learning in the Environment (ICICLE), and in part through Data to Insight Center at Indiana University.

Reference

S. Withana and B. Plale, “Patra ModelCards: AI/ML Accountability in the Edge-Cloud Continuum,” 2024 IEEE 20th International Conference on e-Science (e-Science), Osaka, Japan, 2024, pp. 1-10, doi: 10.1109/e-Science62913.2024.10678710.