Database CI/CD and Schema Migration with ClickHouse and GitHub

Estimated: 30 mins
Database CI/CD and Schema Migration with ClickHouse and GitHub

A series of articles about Database CI/CD and Schema Migration with ClickHouse


In the last article Database CI/CD and Schema Migration with ClickHouse, you have tried UI workflow in Bytebase.

This tutorial will bring you to the next level by introducing the GitOps workflow, where you commit the schema change script to the GitHub repository, which will in turn trigger the schema deployment pipeline in Bytebase.

You can use Bytebase free version to finish the tutorial.

Features included

  • GitOps Workflow
  • Change History

Prerequisites

Before you start this tutorial, make sure:

Step 1 - Run Bytebase in Docker and set the External URL generated by ngrok

ngrok is a reverse proxy tunnel, and in our case, we need it for a public network address in order to receive webhooks from VCS. ngrok we used here is for demonstration purposes. For production use, we recommend using Caddy.

ngrok-reverse-proxy

  1. Login to ngrok Dashboard and follow its Getting Started steps to install and configure.

  2. Run ngrok:

    ngrok http 8080

    and obtain the public URL https://b725-103-197-71-76.ap.ngrok.io: terminal-ngrok

  3. Run Bytebase in Docker with the following command:

    docker run --rm --init \
      --name bytebase \
      --publish 8080:8080 \
      --volume ~/.bytebase/data:/var/opt/bytebase \
      bytebase/bytebase:2.14.1
  4. Bytebase is running successfully in Docker, and you can visit it via localhost:8080. Register an admin account and it will be granted the workspace admin role automatically.

  5. Click the gear icon (Settings) on the top right. Click General under Workspace. Paste https://b725-103-197-71-76.ap.ngrok.io as External URL under Network section and click Update.

    external-url

  6. Bytebase is running successfully in Docker, and you can visit it via https://b725-103-197-71-76.ap.ngrok.io.

Step 2 - Find your ClickHouse instance in Bytebase

  1. Visit Bytebase Console through the browser via your ngrok URL. Log in using your account created from the previous tutorial. bb-login

  2. If you have followed the last article, you should have a project TestClickHouse and a database db_demo. bb-home

Step 3 - Connect Bytebase with GitHub.com

  1. Click Settings on the top bar, and then click Workspace > GitOps. Choose GitHub.com and click Next. bb-gitops-github-step1

  2. Follow the instructions within STEP 2, and in this tutorial, we will use a personal account instead of an organization account. The configuration is similar.

  3. Go to your GitHub account. Click Settings on the dropdown menu. gh-settings-dropdown

  4. Click Developer Settings at the bottom of the left side bar. Click OAuth Apps, and click New OAuth App. gh-oauth-apps

  5. Fill Application name and then copy the Homepage and Authorization callback URL in Bytebase and fill them. Click Register application. gh-register-oauth

  6. After the OAuth application is created successfully. Click Generate a new client secret. Copy Client ID and this newly generated client secret and paste them back in Bytebase. gh-copy-client-id bb-gitops-github-step2 gh-auth

  7. Click Next. You will be redirected to the confirmation page. Click Confirm and add, and the Git provider is successfully added. bb-gitops-github-step3

Step 4 - Enable GitOps workflow with ClickHouse

  1. Go to project TestClickHouse, click GitOps, and choose GitOps Workflow. Click Configure GitOps. bb-project-gitops-gitops-workflow

  2. Choose GitHub.com - the provider you just added. It will display all the repositories you can manipulate. Choose clickhouse-test-bb-local. bb-project-gitops-github bb-project-gitops-github-repo

  3. Keep the default setting, and click Finish.

Step 5 - Change schema for ClickHouse by pushing SQL schema change files to GitHub

  1. In your GitHub repository clickhouse-test-bb-local, create a folder bytebase, then create a subfolder Test, and create an sql file following the pattern {{ENV_ID}}/{{DB_NAME}}##{{VERSION}}##{{TYPE}}##{{DESCRIPTION}}.sql. It is the default configuration for file path template setting under project GitOps.

    test/db_demo##202303020000##ddl##create_t2.sql

    • test corresponds to {{ENV_ID}}
    • db_demo corresponds to {{DB_NAME}}
    • 202303020000 corresponds to {{VERSION}}
    • ddl corresponds to {{TYPE}}
    • create_t2 corresponds to {{DESCRIPTION}}

    Paste the sql script in it.

CREATE TABLE
 t2 (id UInt64, name String) ENGINE = MergeTree
ORDER BY id;

vsc-test-sql

  1. Commit and push this file.

  2. Go to Bytebase, and go into project TestClickHouse. You’ll find there is a new Push Event and a new issue 106 created. bb-push-notification-only

  3. Click issue/106 and go the issue page. Click Resolve issue, and the issue will be Done. You’ll see

    • The issue is created via GitHub.com
    • The issue is executed without approval because it’s on Test environment where manual approval is skipped by default. The Assignee is Bytebase, because the execution is automatic, and requires no manual approval.
    • The SQL is exactly the one we have committed to the GitHub repository.
    • The Creator is A, because the GitHub user you use to commit the change has the same email address found in the Bytebase member list.

bb-issue-create-t2-done

  1. Click View change, and you could view the schema diff. bb-db-change-diff

  2. Go to GitHub repository, and you will see besides your committed sql, there is a .db_demo##LATEST.sql file. Because you have configured [Schema path template](/docs/vcs-integration/name-and-organize-schema-files#schema-path-template) before, Bytebase will write back the latest schema to that specified path after completing the schema change. Thus you have access to an update-to-date full schema at any time. gh-LATEST

Summary and Next

Now you have tried out GitOps workflow, which will store your ClickHouse schema in GitHub and trigger the change upon committing the change to the repository, to bring your ClickHouse change workflow to the next level of Database DevOps - Database as Code.

In real world scenario, you might have separate features and main branches corresponding to your dev and production environment, you can check out GitOps with Feature Branch Workflow to learn the setup. Have a try and look forward to your feedback!

Edit this page on GitHub

Subscribe to Newsletter

By subscribing, you agree with Bytebase's Terms of Service and Privacy Policy.