This is an image for use in a Lagoon advanced task to create a database container image from a live database, possibly sanitising the data on the way.
Currently it only supports mysql/mariadb, but could eventually be extended to support other database systems. It was originally designed for mariadb, hence the naming of some of the files in this project.
There are some requirements and other things that need to be called out for its usage. Some gotchas as well with variable names that may prevent multiple versions working properly that may need revisting.
The mtk config file should be provided as a base64 encoded string, this allows for easier adding to the api to retain data
$ cat example.mtk.yml | base64 | tr -d '\n\r'
cmV3cml0ZToKICAjIERydXBhbCA4CiAgdXNlcnNfZmllbGRfZGF0YToKICAgIG1haWw6IGNvbmNhdCh1aWQsICJAU0FOSVRJU0VEIikKICAgIHBhc3M6ICciU0FOSVRJWkVEX1BBU1NXT1JEIicKICAgICMgV2UgZG9uJ3QgYWxsb3cgbm8gZGlydHkgbmFtZXMgYXJvdW5kIGhlcmUhCiAgICBuYW1lOiBjb25jYXQodWlkLCAiLVNBTklUSVNFRCIpCiAgICBpbml0OiAnIlNBTklUSVNFRF9JTklUIicKICAjIERydXBhbCA3CiAgdXNlcnM6CiAgICBtYWlsOiBjb25jYXQodWlkLCAiQFNBTklUSVNFRCIpCiAgICBwYXNzOiAnIlNBTklUSVpFRF9QQVNTV09SRCInCgp3aGVyZToKICAjIE9ubHkgaW5jbHVkZSBib2R5IGZpZWxkIGRhdGEgZm9yIGN1cnJlbnQgcmV2aXNpb25zLgogIG5vZGVfcmV2aXNpb25fX2JvZHk6IHwtCiAgICAgIHJldmlzaW9uX2lkIElOIChTRUxFQ1QgdmlkIEZST00gbm9kZSkKCm5vZGF0YToKICAtIGNhY2hlKgogIC0gY2FwdGNoYV9zZXNzaW9ucwogIC0gaGlzdG9yeQogIC0gZmxvb2QKICAtIGJhdGNoCiAgLSBxdWV1ZQogIC0gc2Vzc2lvbnMKICAtIHNlbWFwaG9yZQogIC0gc2VhcmNoX2FwaV90YXNrCiAgLSBzZWFyY2hfZGF0YXNldAogIC0gc2VhcmNoX2luZGV4CiAgLSBzZWFyY2hfdG90YWwKICAtIHdhdGNoZG9nCiAgLSB3ZWJmb3JtX3N1Ym1pc3Npb25fZGF0YQoKaWdub3JlOgogIC0gX19BQ1FVSUFfTU9OSVRPUklOR19fCgo=
An example can be found in example.mtk.yml
An example of the GraphQL needed to create an advanced task is available in createDumpSanitisedDB.gql. The variables which can be passed in are also documented in this file (in the "displayName" field of each entry).
The GraphQL examples are:
- createDumpSanitisedDB.gql: Allows the person running the task to change all the (normal) variables
- createDumpSanitisedDB_noArgs.gql: Variables must be set on the environment; the person running the task has no control
- createDumpSanitisedDB_setDBVariables.gql: Allows the person running the task to additionally choose the database to which they connect
Most of the variables are explained in the example GraphQL files, but one in particular requires a better writeup.
This is the name of the resulting image to build, without the tag (eg, for
dockerhub myproject/image, or for custom registry quay.io/myproject/image, can
also be a lpattern like
${registry}/${organization}/${project}/${environment}/${service}-data
)",
The default value is ${registry}/${organization}/${project}/${environment}/${service}-data
The variables available are as follows:
${registry}
:BUILDER_REGISTRY_HOST
(variable defined in Lagoon organisation/project/environment)${organization}
:BUILDER_REGISTRY_ORGANIZATION
(variable defined in Lagoon organisation/project/environment)${project}
: The name of the Lagoon project${environment}
: The name of the Lagoon environment${service}
: The name of the Lagoon service in the docker-compose file${database}
: The name of the database.${database}
is munged so that:- Any special character not allowed in DockerHub repo names is removed (replaced with nothing), and
- If there are two special characters in a row, the first is retained, and later ones are removed (also as per DockerHub repo name requirements)
There are functionally three images we have to worry about:
- The Sanitiser Image: This is the image that dumps the sanitised database and builds the other images
- The Sanitised Builder Image: This builds the sanitised image
- The Sanitised Clean Image: This is the clean MariaDB image that the sanitised database is put in
The image is built using the Dockerfile
. Some parts are written in Go.
This uses the Makefile
to build the parts of the image that are written in Go.
This is the variable calculation part of the process, and was rewritten in Go
so that tests could be applied. Other files associated with this part of the
process are:
cmd/main.go
go.mod
go.sum
main.go
internal/builder/builder.go
internal/builder/builder_test.go
: Tests forinternal/builder/builder.go
internal/builder/variables.go
internal/builder/variables_test.go
: Tests forinternal/builder/variables.go
The entry point is image-builder-entry
. This is just a wrapper around
mariadb-image-builder
.
The process goes through the following basic stages:
- Set up all the initial variables
- MTK creates a database dump that's basically a sanitised .sql file
- Make docker-style container with sanitised DB (using podman); this uses a builder image, and copies the results into a clean image
- Save new container to registry
The files used in this live in the builder
directory (and you could arguably
include the files in the internal/builder directory, which I've listed under
"The Go Parts", above).
These are:
builder/mariadb.Dockerfile
: The dockerfile that's the script for both the builder and clean images mentioned in step 3, abovebuilder/import.my.cnf.tpl
: The my.cnf used in the builder image
builder/my.cnf.tpl
: The my.cnf used in the final sanitised image
renovate.json
allows for the easy upgrading of various component pieces of
software, especially mtk itself.