Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Type

Performance

Clear all

Origin

Scope

SUBMIT A TOOL

If you have a tool that you think should be featured in the Catalogue of AI Tools & Metrics, we would love to hear from you!

SUBMIT
Objective Performance

EducationalUnited KingdomUploaded on Dec 9, 2024
PLIM is designed to make benchmarking and continuous monitoring of LLMs safer and more fit for purpose. This is particularly important in high-risk environments, e.g. healthcare, finance, insurance and defence. Having community-based prompts to validate models as fit for purpose is safer in a world where LLMs are not static. 

Objective(s)


TechnicalInternationalUploaded on Dec 6, 2024
Evaluating machine learning agents on machine learning engineering.

Objective(s)


TechnicalInternationalUploaded on Nov 5, 2024
A fast, scalable, and open-source framework for evaluating automated red teaming methods and LLM attacks/defenses. HarmBench has out-of-the-box support for transformers-compatible LLMs, numerous closed-source APIs, and several multimodal models.

TechnicalUnited StatesUploaded on Aug 2, 2024
AI Security Platform for GenAI and Conversational AI applications. Probe enables security officers and developers identify, mitigate, and monitor AI system security.

ProceduralUploaded on Jul 1, 2024
The MPAI AI Framework (MPAI-AIF) Technical Specification specifies architecture, interfaces, protocols and Application Programming Interfaces (API) of an AI Framework (AIF), especially designed for execution of AI-based implementations, but also suitable for mixed AI and traditional data processing workflows.

Objective(s)


ProceduralUploaded on Jul 1, 2024
MPAI-CAE V1.4 is a collection of four use cases to improve the user audio experience in a variety of situations.

Objective(s)


ProceduralUploaded on Jul 1, 2024
This standard adopts MPAI Technical Specification Version 1.2 as an IEEE Standard.

Objective(s)


ProceduralUploaded on Jul 1, 2024
This Recommendation specifies an architectural framework for network automation based on artificial intelligence (AI) for resource and fault management in future networks, including international mobile telecommunications-2020 (IMT-2020).

Objective(s)


ProceduralUploaded on Jul 1, 2024
Recommendation ITU-T Y.3172 specifies an architectural framework for machine learning (ML) in future networks including IMT-2020.

Objective(s)


ProceduralUploaded on Jul 1, 2024
This Recommendation provides an architectural framework for machine learning (ML) models serving in future networks including IMT-2020, i.

Objective(s)


ProceduralUploaded on Jul 2, 2024
This DIN SPEC in accordance with the PAS procedure has been drawn up by a DIN SPEC (PAS)-consortium set up on a temporary basis.

Objective(s)


ProceduralUploaded on Jul 2, 2024
The purpose of AI-ESTATE is to standardize interfaces for functional elements of an intelligent diagnostic reasoner and representations of diagnostic knowledge and data for use by such diagnostic reasoners

Objective(s)


ProceduralUploaded on Jul 2, 2024
Recommendation ITU-T Y.3654 specifies the mechanisms of machine learning in big data driven networking (bDDN).

Objective(s)


ProceduralUploaded on Jul 2, 2024
Evaluating deep learning software frameworks to help manufactures take full advantage of certain frameworks and avoid the disadvantages of others.

Objective(s)


ProceduralUploaded on Jul 2, 2024
This Recommendation describes the functional entities and architecture for emotion enabled multimodal user interface based on artificial neural networks.

Objective(s)


Uploaded on Jul 2, 2024
The purpose of this work item is to specify quantitative evaluation criteria of network autonomicity categories, which is defined in the published GR ENI 007.

Objective(s)


ProceduralUploaded on Jul 2, 2024
The purpose of the present document is to provide information on prominent control loop architectures that can be used in modular system design.

Objective(s)


ProceduralUploaded on Jul 2, 2024
Recommendation ITU-T Y.3173 specifies a framework for evaluating the intelligence of future networks including IMT-2020 and a method for evaluating the intelligence levels of future networks including IMT-2020 is introduced.

Objective(s)


ProceduralUploaded on Jul 2, 2024
This Recommendation provides a framework for artificial intelligence (AI) enhanced telecom operation and management (AITOM).

Objective(s)


ProceduralUploaded on Jul 2, 2024
This Recommendation specifies a functional architecture of quality of service (QoS) assurance based on machine learning (ML) for the international mobile telecommunications-2020 (IMT-2020) network.

Objective(s)


catalogue Logos

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.