Tools for Trustworthy AI

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Show tools Show use cases

Objective Robustness & digital security

AIxploit

TechnicalFranceUploaded on Dec 6, 2024

AIxploit is a tool designed to evaluate and enhance the robustness of Large Language Models (LLMs) through adversarial testing. This tool simulates various attack scenarios to identify vulnerabilities and weaknesses in LLMs, ensuring they are more resilient and reliable in real-world applications.

Objective(s)

Robustness & digital security Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate

Adversa: AI Red Teaming Platform

TechnicalUploaded on Dec 6, 2024

Continuous proactive AI red teaming platform for AI and GenAI models, applications and agents.

Objective(s)

Privacy & data governance Robustness & digital security

Related lifecycle stage(s)

Verify & validate Build & interpret model Plan & design

Vectice

TechnicalProceduralUnited StatesUploaded on Dec 6, 2024

Vectice is a regulatory MLOps platform for AI/ML developers and validators that streamlines documentation, governance, and collaborative reviewing of AI/ML models. Designed to enhance audit readiness and ensure regulatory compliance, Vectice automates model documentation, from development to validation. With features like automated lineage tracking and documentation co-pilot, Vectice empowers AI/ML developers and validators to work in their favorite environment while focusing on impactful work, accelerating productivity, and reducing risk.

Objective(s)

Robustness & digital security Transparency & explainability

Related lifecycle stage(s)

Deploy Verify & validate Build & interpret model

Mindgard

TechnicalUnited KingdomUploaded on Dec 6, 2024

Continuous automated red teaming for AI, minimize security threats to AI models and applications.

Objective(s)

Robustness & digital security

Related lifecycle stage(s)

Operate & monitor Deploy Verify & validate

PyRIT

TechnicalUnited StatesUploaded on Nov 8, 2024

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

Objective(s)

Robustness & digital security Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate

Resaro

ProceduralSingaporeUploaded on Oct 2, 2024

Resaro offers independent, third-party assurance of mission-critical AI systems. It promotes responsible, safe and robust AI adoption for enterprises, through technical advisory and evaluation of AI systems against emerging regulatory requirements.

Objective(s)

Robustness & digital security Transparency & explainability

FairNow: AI Governance Platform

ProceduralUploaded on Oct 2, 2024

FairNow is an AI governance software tool that simplifies and centralises AI risk management at scale. To build and maintain trust with customers, organisations must conduct thorough risk assessments on their AI models, ensuring compliance, fairness, and security. Risk assessments also ensure organisations know where to prioritise their AI governance efforts, beginning with high-risk models and use cases.

Objective(s)

Accountability Robustness & digital security

garak

TechnicalUploaded on Nov 5, 2024

garak, Generative AI Red-teaming & Assessment Kit, is an LLM vulnerability scanner. Garak checks if an LLM can be made to fail.

Objective(s)

Robustness & digital security Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate

HarmBench

TechnicalInternationalUploaded on Nov 5, 2024

A fast, scalable, and open-source framework for evaluating automated red teaming methods and LLM attacks/defenses. HarmBench has out-of-the-box support for transformers-compatible LLMs, numerous closed-source APIs, and several multimodal models.

Objective(s)

Performance Robustness & digital security

Responsible innovation toolkit: Harms modeling framework

TechnicalUnited StatesUploaded on Sep 9, 2024

Harms Modeling is a practice designed to help you anticipate the potential for harm, identify gaps in product that could put people at risk, and ultimately create approaches that proactively address harm.

Objective(s)

Robustness & digital security Safety

Dioptra

TechnicalUnited StatesUploaded on Sep 9, 2024

Dioptra is an open source software test platform for assessing the trustworthy characteristics of artificial intelligence (AI). It helps developers on determining which types of attacks may impact negatively their model's performance.

Objective(s)

Robustness & digital security Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate Build & interpret model

BELLS - Benchmarks for the Evaluation of LLM Safeguards

TechnicalFranceUploaded on Aug 2, 2024

Evaluate input-output safeguards for LLM systems such as jailbreak and hallucination detectors, to understand how good they are and on which type of inputs they fail.

Objective(s)

Robustness & digital security Safety Transparency & explainability

Related lifecycle stage(s)

Operate & monitor Verify & validate

Probe: End-to-end AI Security Platform

TechnicalUnited StatesUploaded on Aug 2, 2024

AI Security Platform for GenAI and Conversational AI applications. Probe enables security officers and developers identify, mitigate, and monitor AI system security.

Objective(s)

Performance Robustness & digital security Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate

DIN SPEC 92001-2 - Artificial Intelligence - Life Cycle Processes and Quality Requirements - Part 2: Robustness

ProceduralUploaded on Jul 2, 2024

The DIN SPEC series describes a number of AI quality requirements which are structured using an AI quality meta model. The DIN SPEC series applies to all phases of the life cycle of an AI module.

Objective(s)

Robustness & digital security

IEEE 2801-2022 - IEEE Recommended Practice for the Quality Management of Datasets for Medical Artificial Intelligence

ProceduralUploaded on Jul 2, 2024

The document highlights quality objectives for organizations responsible for datasets. The document describes control of records during the lifecycle of datasets, including but not limited to data collection, annotation, transfer, utilization, storage, maintenance, updates, retirement, and other activities.

Objective(s)

Robustness & digital security

IEEE 2830-2021 - IEEE Standard for Technical Framework and Requirements of Trusted Execution Environment based Shared Machine Learning

ProceduralUploaded on Jul 2, 2024

This standard defines a framework and architectures for machine learning in which a model is trained using encrypted data that has been aggregated from multiple sources and is processed by a third party trusted execution environment (TEE).

Objective(s)

Robustness & digital security

IEEE 3333.1.3-2022 - IEEE Standard for the Deep Learning-Based Assessment of Visual Experience Based on Human Factors

ProceduralUploaded on Jul 2, 2024

In this standard, quality of experience (QoE) assessment is categorized into two subcategories which are perceptual quality and virtual reality (VR) cybersickness.

Objective(s)

Robustness & digital security

ISO/IEC TR 24027:2021 - Information technology - Artificial intelligence (AI) - Bias in AI systems and AI aided decision making

ProceduralUploaded on Jul 3, 2024

This document addresses bias in relation to AI systems, especially with regards to AI-aided decision-making.

Objective(s)

Robustness & digital security

ETSI GR SAI 001 V 1.1.1 - Securing Artificial Intelligence (SAI) - AI Threat Ontology

ProceduralUploaded on Jul 1, 2024

The purpose of this work item is to define what would be considered an AI threat and how it might differ from threats to traditional systems.

Objective(s)

Robustness & digital security

ETSI GR SAI 005 V 1.1.1 - Securing Artificial Intelligence (SAI) - Mitigation Strategy Report

ProceduralUploaded on Jun 28, 2024

This work item aims to summarize and analyze existing and potential mitigation against threats for AI-based systems.

Objective(s)

Robustness & digital security

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.

Type

Origin

Scope

SUBMIT A TOOL

AIxploit

Adversa: AI Red Teaming Platform

Vectice

Mindgard

PyRIT

Resaro

FairNow: AI Governance Platform

garak

HarmBench

Responsible innovation toolkit: Harms modeling framework

Dioptra

BELLS - Benchmarks for the Evaluation of LLM Safeguards

Probe: End-to-end AI Security Platform

DIN SPEC 92001-2 - Artificial Intelligence - Life Cycle Processes and Quality Requirements - Part 2: Robustness

IEEE 2801-2022 - IEEE Recommended Practice for the Quality Management of Datasets for Medical Artificial Intelligence

IEEE 2830-2021 - IEEE Standard for Technical Framework and Requirements of Trusted Execution Environment based Shared Machine Learning

IEEE 3333.1.3-2022 - IEEE Standard for the Deep Learning-Based Assessment of Visual Experience Based on Human Factors

ISO/IEC TR 24027:2021 - Information technology - Artificial intelligence (AI) - Bias in AI systems and AI aided decision making

ETSI GR SAI 001 V 1.1.1 - Securing Artificial Intelligence (SAI) - AI Threat Ontology

ETSI GR SAI 005 V 1.1.1 - Securing Artificial Intelligence (SAI) - Mitigation Strategy Report