Santander is opening up its technology to help build more trustworthy, competitive artificial intelligence (AI) in banking, on a par with sector leaders. The bank has shared over a dozen of its AI projects under an Open Source licence, enabling developers, researchers and professionals the world over to use this technology free of charge to grow. You can check out what Santander has published on its new channel on GitHub, the global repository of shared initiatives.
Banco Santander remains at the forefront of AI development by offering public repositories on Open Source to foster technical cooperation and shared learning. The 11 projects it has chosen apply directly to banks and other businesses and could bring benefits to several sectors. The aim is to work with the community on developing the AI of the future.
From now on, other developers will be able to put forward proposals to enhance Santander AI Lab's tools, thus optimizing a model of lifelong learning and development based on shared intelligence. According to José Manuel de la Chica, Head of Santander AI Lab, “we’re not doing this because Open Source is the talk of the town. We’re doing it because the true challenges of advanced AI — security, equity, robustness, privacy, governance and traceability — are too important and cross-cutting to address them in silos”.
At Santander, we believe that the next phase of artificial intelligence will not just depend on who has access to the most advanced models, but on who is capable of using them with rigour, confidence and responsibility. In banking, this means being able to prove that systems are secure, fair, robust and auditable. That’s why we decided to open up some of the Santander AI Lab’s work — special tools, without real customer data, which can help the community make headway with some of AI’s toughest challenges. We want to contribute, learn and cooperate from a very clear position: technological ambition, acting responsibly at all times, and with our feet firmly on the ground.
José Manuel de la Chica, Head of Santander AI Lab
This initiative enables us to share tools, examples and resources developed by the bank’s teams in such areas as AI, machine learning, large language models, generative AI, responsible AI, and AI governance. Publishing these projects is part of Santander’s commitment to responsible innovation, cooperation with the technology ecosystem, and exchange of technical knowledge under internal intellectual property, data protection, cybersecurity, licensing and brand review processes. Each repository includes technical documents, an Apache 2.0 Open Source licence, contribution guides, codes of conduct, security information, and review processes.
Santander AI Lab has published the gen-fraud-graph on GitHub. It’s a tool that helps create synthetic networks of fraud-related transactions and behaviours. It addresses one of banking's major AI challenges: learning to uncover complex patterns without compromising people's privacy. The idea is simple: if we want to enhance fraud detection, we need environments where we can test hypotheses, compare models, and understand how certain signs emerge. This project generates artificial data to mirror specific risk patterns. There are no real customer data as “AI innovation must run alongside privacy, not at its expense”.
We have also shared mech-gov-framework, which explores how to build governance mechanisms based on language models, especially when they can influence sensitive or high-impact decisions. It's a key initiative as “it attempts to enable the rigorous review, traceability and governance of AI functioning”. The aim is to turn security, consistency, auditability, control and other principles into more specific and measurable elements. Rather than blindly trusty a system’s response, the project proposes working with rules, thresholds and checks that help decide when to go live, under what conditions, and with which limits.
Another project that Santander AI Lab has opened up is mutatis-mutandis, which addresses the challenge of getting AI systems to act fairly and treat people in comparable situations equally. This initiative looks into algorithmic discrimination to enable analysis with counterfactual comparators. The main idea is to move beyond general statements on fairness to tools that enable us to examine, test and discuss the behaviour of models with greater precision, as “trust in AI is built not only by looking at its results, but by understanding how it gets there and whom it may affect”.
Santander AI Lab initiatives on Open Source
Gen-Fraud-Graph
Project Open Source, licenced under Apache 2.0, aims to create synthetic fraud graphs for research, experimentation and the development of advanced fraud detection capabilities.
Mutatis-mutandis
Open Source project under the Apache-2.0 licence, with a focus on situation testing for discrimination analysis with counterfactual comparators, published as research code for the paper "Mutatis Mutandis: Revisiting the Comparator in Discrimination Testing" to enhance algorithmic fairness and responsible AI.
Ralph
Open Source project under the Apache-2.0 licence that provides a configurable Bash/PowerShell loop that runs an AI coding command-line interface (CLI), starting a fresh agent session on every iteration, to automate and experiment with development agents.
Autoguardrails
Open Source project under the Apache-2.0 licence, aimed at alignment-research scaffold (autoresearch-style) for LLM guardrails: search over a single policy.md surface.
Ralph-vault-skill
Open Source project under the Apache-2.0 licence that provides a skill to generate the knowledge vault for projects using the Ralph loop, aimed at automatic documenting and RAG flows.
Genetic-algorithm
Open Source project under the Apache-2.0 licence that implements a dependency-free Python genetic-algorithm engine with pluggable fitness criteria as a reusable search core for an LLM/AI autoresearcher.
Mech-gov-framework
Open Source project under the Apache-2.0 licence that focuses on mechanical Governance for LLM decisions, with model-agnostic governance regimes (R1/R2/R3), hard gates, entropy commit-reveal, and governance metrics for high-stakes LLM decision systems.
Auto-bayesian
Open Source project under the Apache-2.0 licence that provides config-driven, interpretable Bayesian network training for relational tabular data, with a focus on explicability and responsible AI.
Linear-adapter-trainer
Open Source project under the Apache-2.0 licence to train linear embedding adapters with triplet loss to align retrieval embeddings with user queries and to enhance RAG systems.
LLM_bridge
Open Source project under the Apache-2.0 licence that offers a tiny, vendor-neutral LLM client library, with one interface (LLMClient) with pluggable adapters for OpenAI, AWS Bedrock and Google Gemini, or bring-your-own backends.
Sota-stressed-datasets
Open Source project (code under the Apache-2.0 licence — the datasets have their own data licences) that pens benchmark datasets republished in stressed form to evaluate ML/LLM robustness. Curated by Santander AI Lab.