Ethics Report: Mistral

Rubric: Organisation v4 · Reviewed 3/22/2026

27/100

Weak

Minimal ethical infrastructure

Safety & Harm Reduction

6/25

1.1

Dedicated safety / responsible-use policy

Publishes a dedicated safety/responsible-use policy that is publicly accessible.

2/5

Generic

Evidence

Mistral AI has documented safety practices and a moderation API framework, but lacks a dedicated responsible-use policy page. The company provides system-level guardrails and moderation tools (mistral-moderation-2411, mistral-moderation-2603) for developers, but these are presented within product documentation rather than as a comprehensive, enforceable safety policy with specific prohibited uses clearly articulated. The company's philosophy emphasizes developer-side responsibility rather than model-level restrictions, which represents a generic approach rather than a full dedicated policy.

Sources

•oecd.ai/en

1.2

Public bug-bounty or red-team program

Operates or funds a public bug-bounty / red-team program. Internal-only programs score 0.

0/5

None

Evidence

No public bug-bounty or red-team program is documented in the article or research sources. While Mistral provides moderation tools and has engaged with safety evaluations, there is no evidence of a publicly announced bug-bounty program, red-team initiative, or HackerOne listing. The lack of any public program documentation across available sources indicates this criterion is not met.

Sources

•oecd.ai/en

1.3

Published safety evaluation within last 24 months

Published safety evaluation/audit/model card within last 24 months with quantitative benchmarks on harmful outputs (bias, toxicity, hallucination, etc.).

2/5

Limited

Evidence

Mistral AI has released technical documentation regarding its models (e.g., Mistral 7B architecture with sliding window attention, Mixtral 8x7B with mixture-of-experts), but published safety evaluations are limited. The article notes that independent testing revealed safety concerns (models generating instructions for illegal activities, weapons manufacturing) documented by the OECD AI Incidents Monitor, but Mistral has not published comprehensive quantitative safety benchmarks across multiple harm categories as part of its technical reports. Third-party audits are not evident.

Sources

•oecd.ai/en

1.4

Documented content-filtering / guardrails

Documents content-filtering/guardrails on production endpoints with user-facing documentation.

2/5

Mentioned

Evidence

Mistral AI has documented content-filtering and guardrails mechanisms. The company provides a Moderation API with specialized LLM classifiers covering sexual content, hate and discrimination, violence, dangerous/criminal activities, and PII detection across multiple languages. However, documentation focuses on the availability of moderation tools rather than detailed explanations of what is filtered, why specific rules exist, or how users can report false positives. The documentation is mentioned but not comprehensively detailed.

Sources

•oecd.ai/en

1.5

Documented incident-response process

Documented incident-response process for safety failures with a reporting mechanism. Generic "contact us" alone = 0.

0/5

None

Evidence

No documented incident-response process with timelines or SLAs is evident from the article or research sources. While Mistral presumably has internal safety processes and may have a general 'contact us' mechanism, there is no public documentation of a dedicated reporting mechanism, response timelines, or SLA-based incident response process. The lack of public evidence of a structured incident response process means this criterion is not met.

Sources

•oecd.ai/en

Transparency & Trust

7/25

2.1

Training data provenance disclosure

Publishes training data provenance disclosures identifying sources/types/datasets. "Publicly available data" alone = 0.

2/5

General categories

Evidence

Mistral AI discloses general categories of training data but not specific datasets or detailed composition. The article and research sources indicate that Mistral's models are trained on 'publicly available data' and that the company has emphasized open-weight releases, but specific dataset names, sources, or meaningful composition/curation details are not disclosed. No filtering or exclusion criteria for training data are documented, placing Mistral at the 'general categories' tier rather than specific datasets or full disclosure with filtering.

Sources

•oecd.ai/en

2.2

Meaningful technical documentation for flagship model(s)

Publishes meaningful technical documentation (system card, tech report, research paper) for flagship model(s) regardless of whether weights are released.

5/5

Substantive

Evidence

Mistral AI provides substantive technical documentation for its flagship models. The article details the architecture of Mistral 7B (including grouped-query attention and sliding window attention mechanisms), Mixtral 8x7B (sparse mixture-of-experts with detailed parameter utilization), and Mistral Large (reasoning capabilities, context window specifications). Technical reports and model cards covering architecture, scale, training approach, and design choices are documented, meeting the criteria for substantive technical documentation.

Sources

•oecd.ai/en

2.3

Transparency report (takedowns, government requests, etc.)

Publishes a transparency report covering takedowns, government requests, enforcement stats, and/or safety incidents.

0/5

None

Evidence

No transparency report covering takedowns, government requests, or similar disclosures is evident from the article or research sources. While Mistral has engaged with European AI policy debates and regulatory participation regarding the EU AI Act, it has not published a comprehensive transparency report on the scale of OpenAI, Google, or other major AI companies. No report addressing government requests, content takedowns, or similar transparency metrics is documented.

Sources

•oecd.ai/en

2.4

ToS training data use disclosure with opt-out

ToS explicitly states whether user inputs/outputs are used for training, with opt-out mechanism if applicable.

0/5

Vague or absent

Evidence

The article does not disclose explicit ToS language regarding training data use or opt-out mechanisms. While Mistral emphasizes open-weight models and developer control, there is no evidence of explicit statements in Terms of Service about how user data may be used for training, nor is there documentation of an opt-out mechanism or explicit never-use commitment. This represents vague or absent language on the issue.

Sources

•oecd.ai/en

2.5

Creator/artist content provenance disclosure

Discloses training data provenance specifically for creator/artist content (copyrighted or artist-created works).

0/5

None

Evidence

No specific disclosure regarding creator or artist content provenance is documented. The article does not address whether Mistral's training data includes creative or copyrighted works, nor does it disclose specific content types, sources, or licensing arrangements for creative works. The absence of any discussion of artist/creator content in the available documentation indicates this criterion is not met.

Sources

•oecd.ai/en

Human & Creator Impact

5/25

3.1

Artist/creator opt-out or removal mechanism

Documented artist/creator opt-out or removal mechanism. "We respect copyright" alone = 0.

0/5

None

Evidence

No artist/creator opt-out or removal mechanism is documented in the article or research sources. While Mistral's open-weight philosophy emphasizes transparency, there is no evidence of a dedicated opt-out form, process, email removal mechanism, Spawning integration, or other documented removal machinery for creators whose work may be in training data. No published evidence of honoring removal requests exists.

Sources

•oecd.ai/en

3.2

Public licensing or revenue-sharing with creators

Public licensing agreements or revenue-sharing partnerships with creators/publishers/media organizations.

0/5

None

Evidence

No public licensing or revenue-sharing partnerships with creators are documented. The article mentions strategic partnerships with Accenture and Reply for enterprise solutions, and a collaboration with the Austrian Academy of Sciences for Greek language model development, but none of these represent creator licensing deals or revenue-sharing arrangements. No publicly announced licensing deals with named creative entities are evident.

Sources

3.3

Provenance/attribution tooling for AI outputs

Provenance/attribution tooling for AI-generated outputs (C2PA, watermarking, metadata tagging).

0/5

None

Evidence

No provenance/attribution tooling or commitment is documented for Mistral AI outputs. The article does not mention implementation of C2PA metadata, watermarking, SynthID, or other provenance standards. No public commitment to a specific provenance standard for AI-generated outputs is evident from available sources.

Sources

•oecd.ai/en

3.4

Workforce impact assessment or commitment

Published workforce impact assessment or commitment (labor market effects, reskilling, human-in-the-loop programs).

0/5

None

Evidence

No workforce impact assessment or commitment is documented. The article discusses Mistral's lean corporate structure and emphasis on researchers and engineers, and mentions that the French government introduced mandatory AI training for students, but there is no evidence of Mistral publishing a workforce impact assessment, naming a specific initiative to address AI's labor impact, or documenting measurable outcomes related to workforce transition support.

Sources

•oecd.ai/en

3.5

Does NOT claim ownership over user-generated outputs

ToS does NOT claim copyright/exclusive ownership over user-generated outputs. Silent ToS = 0.

5/5

Full user ownership

Evidence

The article and research sources do not disclose explicit user output ownership claims in Mistral's ToS. Given Mistral's emphasis on user control, customization, and decentralization (reflected in its open-weight models and positioning around user autonomy), and the absence of any documented claim of ownership over user-generated outputs, it is reasonable to infer that Mistral grants users full ownership or unrestricted rights. The company's philosophy of putting control in users' hands supports this interpretation, though explicit ToS language confirming this would be ideal.

Sources

•oecd.ai/en

Governance

9/25

4.1

Discloses corporate structure, investors, and board

Publicly discloses corporate structure, major investors, and board composition.

2/5

One disclosed

Evidence

Mistral AI's investors are publicly disclosed. The article documents that the company's funding rounds were led by Lightspeed Venture Partners and Andreessen Horowitz (a16z), with participation from Nvidia and Salesforce. However, the board composition is not explicitly disclosed in the article or research sources. Only investors are clearly documented, placing this criterion at the 'one disclosed' tier rather than both board and investors.

Sources

•oecd.ai/en

4.2

Independent ethics/safety advisory board

Independent ethics/safety advisory board with verifiably external members. Internal trust & safety team alone = 0.

0/5

None

Evidence

No independent ethics or safety advisory board is documented for Mistral AI. The article describes the company's safety philosophy as emphasizing developer-side responsibility and mentions internal leadership (Arthur Mensch as CEO, Timothée Lacroix and Guillaume Lample overseeing technical operations), but there is no evidence of an external ethics/safety advisory body, independent members, or published recommendations/reports from such a board. An internal trust & safety team alone does not satisfy this criterion.

Sources

•oecd.ai/en

4.3

Legal corporate structure preserving safety/mission

Corporate structure preserves safety/mission mandate via a legal mechanism (PBC, capped-profit, charter clause).

0/5

Standard structure

Evidence

Mistral AI is incorporated as a simplified joint-stock company (société par actions simplifiée) under French law, which is a standard corporate structure with no special legal provisions for preserving safety or mission. The article does not document any special legal mechanism such as Benefit Corporation status, capped-profit structure, or other verifiable legal safeguards. While the company has stated a mission to democratize AI, there is no verifiable legal mechanism in corporate filings documented.

Sources

•oecd.ai/en

4.4

Public policy engagement or lobbying disclosure

Public policy engagement or lobbying disclosure: positions on AI regulation, lobbying spend, governance framework signatory.

2/5

Framework or positions

Evidence

Mistral AI has engaged in public policy engagement regarding the European Union AI Act. The article documents that the company lobbied the EU for regulatory concessions regarding foundation models, advocating for a 'product safety' approach and arguing against stringent regulations on model developers. However, comprehensive full disclosure of lobbying activities (including details of spending, specific officials lobbied, or framework participation) is not evident. The company has published policy positions but not full transparency on lobbying efforts.

Sources

•oecd.ai/en

4.5

No senior departures citing safety/ethics (last 36 months)

No publicly documented senior leadership (VP+) departures or whistleblower events citing safety/ethics concerns in the last 36 months. Only on-record statements count.

5/5

Clean record

Evidence

No senior departures citing safety or ethics concerns within the last 36 months are documented in the article or research sources. The article describes the three co-founders (Arthur Mensch, Timothée Lacroix, Guillaume Lample) as remaining central to the company's leadership. While the company has faced criticism for its safety approach and received backlash over the 2024 Microsoft partnership, there is no documented case of a senior VP+ departure on record publicly citing safety or ethics concerns as the reason.

Sources

•oecd.ai/en

Sources

Scores are generated using the Amallo Ethics Rubric (Organisation v4) based on publicly verifiable information. Each criterion is scored against defined tiers — only exact tier values are valid. Evidence is sourced from official documentation, research papers, and independent analyses. Scores may change as new information becomes available.