New to KubeDB? Please start here.

MariaDB Database Migration

This guide will show you how to use KubeDB Migrator to migrate an existing MariaDB database — such as one running on AWS RDS or any external instance — entirely into a KubeDB-managed MariaDB with minimal downtime.

Before You Begin

At first, you need to have a Kubernetes cluster, and the kubectl command-line tool must be configured to communicate with your cluster.
Install KubeDB operator with the Migrator operator enabled in your cluster following the steps here.
The source MariaDB instance must be network-reachable from within your Kubernetes cluster.
The source MariaDB instance must have binary logging enabled with binlog_format=ROW and binlog_row_image=FULL. The database user provided for migration must have replication privileges.
You should be familiar with the following KubeDB concepts:

To keep everything isolated, we are going to use a separate namespace called demo throughout this tutorial.

$ kubectl create ns demo
namespace/demo created

Prepare Source Connection Information

First, create an authentication secret to communicate with the source MariaDB database:

$ kubectl create secret generic source-mariadb-auth -n demo \
                --type=kubernetes.io/basic-auth \
                --from-literal=username=<username> \
                --from-literal=password=<password>

Now create an AppBinding with the necessary information. The Migrator operator reads the source MariaDB connection information from this AppBinding CR. Use the following YAML to create your AppBinding:

apiVersion: appcatalog.appscode.com/v1alpha1
kind: AppBinding
metadata:
  name: source-mariadb
  namespace: demo
spec:
  type: mariadb
  version: "10.5.23"
  clientConfig:
    url: "mariadb://host:port"
  secret:
    name: source-mariadb-auth

Here,

spec.clientConfig.url is the connection URL of the source MariaDB instance.
spec.secret.name is the reference to the secret we created earlier, containing the MariaDB authentication information.

For a KubeDB-managed database, an AppBinding is created by default. So there is no need to create one for the target database.

Create Target MariaDB Database

KubeDB implements a MariaDB CRD to define the specification of a MariaDB database. Follow the MariaDB object to create the target database.

apiVersion: kubedb.com/v1
kind: MariaDB
metadata:
  name: target-mariadb
  namespace: demo
spec:
  version: "10.5.23"
  storageType: Durable
  storage:
    storageClassName: "local-path"
    accessModes:
      - ReadWriteOnce
    resources:
      requests:
        storage: 20Gi
  deletionPolicy: WipeOut

$ kubectl apply -f target-mariadb.yaml
mariadb.kubedb.com/target-mariadb created

Note: Adjust the resources.requests.storage based on the source database size.

Wait until target-mariadb has status Ready.

Apply Migrator CR

To migrate the database we have to create a Migrator CR. Below is the YAML of the Migrator CR that we are going to create:

apiVersion: migrator.kubedb.com/v1alpha1
kind: Migrator
metadata:
  name: mariadb-migrate
  namespace: demo
spec:
  jobTemplate:
    spec:
      securityContext:
        fsGroup: 65534
  source:
    mariadb:
      connectionInfo:
        appBinding:
          name: source-mariadb
          namespace: demo
        dbName: "mysql"
        maxConnections: 100
      schema:
        enabled: true
        database: [] # database to include
        excludeDatabase: [] # database to exclude
      snapshot:
        enabled: true
        pipeline:
          workers: 3
          sinkers: 4
          buffer: 12
          write_batch_size: 200
          read_batch_size: 1000
      streaming:
        enabled: true

  target:
    mariadb:
      connectionInfo:
        appBinding:
          name: target-mariadb
          namespace: demo
        dbName: "mysql"
        maxConnections: 100

Here,

spec.source / spec.target — connectionInfo:

appBinding.name / appBinding.namespace — references the AppBinding for the source or target MariaDB instance.
dbName — the internal database used as the initial connection entry point.
maxConnections — limits the number of concurrent connections the migrator opens to this MariaDB instance.

spec.source.schema — schema migration phase:

enabled: true — enables the schema migration phase.
database — list of databases to include; empty means all databases.
excludeDatabase — list of databases to exclude from migration.

spec.source.snapshot — bulk snapshot phase:

enabled: true — enables the initial bulk snapshot phase.
pipeline.workers — number of parallel workers, each processing a separate table concurrently.
pipeline.sinkers — number of parallel write workers pushing data to the target for each worker.
pipeline.buffer — size of the in-memory queue (in records) between readers and writers.
pipeline.read_batch_size — number of rows fetched per read batch from the source.
pipeline.write_batch_size — number of rows written per batch to the target.

spec.source.streaming — CDC streaming phase:

enabled: true — enables change-data capture streaming after the snapshot completes, keeping the target continuously in sync with ongoing changes on the source.

Watch Migration Progress

Let’s wait for the LAG to reach near zero. Run the following command to watch Migrator CR:

Every 2.0s: kubectl get migrator -n demo

NAME              PHASE     DBTYPE    STAGE       LAG   PROGRESS   AGE
mariadb-migrate   Running   mariadb   Streaming   0B               4h36m

Cutover

Once the LAG drops to near zero, stop all writes to the source database. Wait until the LAG reaches exactly zero — at that point both databases are fully in sync.

Now delete the Migrator CR to stop the migration process:

$ kubectl delete migrator -n demo mariadb-migrate
migrator.migrator.kubedb.com "mariadb-migrate" deleted

Finally, update your application’s connection string to point to the target KubeDB-managed MariaDB database. The migration is complete.

KubeDB Operator

KubeDB Platform

Internal DBaaS for platform teams

Automate database operations with GitOps

Multi-tenant database infrastructure

White-labeled DBaaS offering

Run DBaaS in secure, offline clusters

PostgreSQL

MySQL

MariaDB

Microsoft SQL Server

Oracle

Percona XtraDB

SAP HanaDB

IBM DB2

MongoDB

Cassandra

DocumentDB

Redis

Valkey

Memcached

Ignite

Hazelcast

Elasticsearch

OpenSearch

Solr

Kafka

RabbitMQ

ClickHouse

Druid

SingleStore

Milvus

Qdrant

Weaviate

Neo4j

PgBouncer

Pgpool

ProxySQL

ZooKeeper

Amazon EKS

Google GKE

Microsoft AKS

Red Hat OpenShift

SUSE Rancher

Nutanix Kubernetes Platform

Mirantis Kubernetes Engine

VMware vSphere Kubernetes Service

Alliance Partners

Channel Partners

Managed Service Providers

Operator Documentation

Platform Documentation

Orange Telecom powers its digital future with KubeDB

10× lower costs. 2× faster performance. All with KubeDB.

White-label DBaaS live in 1 month — serving 20,000+ customers

Deploy Cassandra via Kubernetes Cassandra Operator

How to Deploy ClickHouse via Kubernetes ClickHouse Operator

Deploy Memcached using Kubernetes Memcached Operator

Blog

ClickHouse Ops Requests - Day 2 Lifecycle Management for ClickHouse Using KubeDB (Part-2)

Provision and Manage Milvus on Kubernetes using KubeDB

Provision and Manage Weaviate on Kubernetes using KubeDB

Provision and Manage Qdrant on Kubernetes using KubeDB

Videos