GitLab CI/CD Migration for Maven Build Pipelines

The transition of build automation from Jenkins to GitLab CI/CD represents a fundamental shift in how software delivery pipelines are architected, moving from a centralized controller model to a distributed, version-controlled configuration approach. In the context of Maven-based Java projects, this migration involves translating Jenkins' declarative pipelines or freestyle jobs into the .gitlab-ci.yml syntax. The primary objective of this transition is to align the build, test, and install lifecycles with GitLab's job-based execution model, ensuring that the artifacts are produced and validated consistently across different environments. This process necessitates a deep understanding of how Maven's internal repository management interacts with the ephemeral nature of GitLab runners, specifically regarding the handling of the .m2 directory and the configuration of the Java Development Kit (JDK) and Maven versions.

Architectural Paradigms of Jenkins and GitLab CI/CD

To understand the migration process, one must first analyze the starting point within Jenkins. Jenkins typically manages Maven builds through three primary methods: freestyle projects with shell execution, the Maven task plugin, or declarative pipelines. In a declarative pipeline, the configuration is often stored in a Jenkinsfile within the Git repository or directly in the Jenkins controller. A typical Jenkins declarative pipeline for Maven utilizes a structure where an agent is defined, tools such as maven-3.6.3 and jdk11 are specified, and a sequence of stages (Build, Test, Install) is executed via shell commands.

In contrast, GitLab CI/CD utilizes a YAML-based configuration file located at the root of the repository. The fundamental unit of execution in GitLab is the "job," and these jobs are grouped into "stages." While Jenkins pipelines often rely on persistent agents where tools are pre-installed, GitLab encourages the use of ephemeral Docker containers. This shift removes the burden of maintaining virtual machines and ensures that every build starts from a clean, known state, thereby eliminating the "it works on my machine" problem that frequently plagues persistent Jenkins nodes.

The Migration Path: From Jenkins Pipeline to .gitlab-ci.yml

The migration from a Jenkins declarative pipeline to GitLab CI/CD involves mapping the stages and steps from the Jenkinsfile to the stages, variables, and script keywords in GitLab. In Jenkins, a pipeline might look like this:

groovy pipeline { agent any tools { maven 'maven-3.6.3' jdk 'jdk11' } stages { stage('Build') { steps { sh "mvn package -DskipTests" } } stage('Test') { steps { sh "mvn test" } } stage('Install') { steps { sh "mvn install -DskipTests" } } } }

The equivalent GitLab CI/CD configuration replaces the tools block with a global image definition and converts the steps into script blocks within individual jobs. This migration ensures that the behavior and syntax of building, testing, and installing remain consistent with the original Jenkins workflow while leveraging GitLab's native orchestration capabilities.

Detailed Analysis of the GitLab CI/CD Configuration

A fully migrated Maven pipeline in GitLab consists of global keywords followed by specific job definitions. The structure is designed to optimize execution speed and resource utilization.

Global Pipeline Keywords

The global keywords establish the environment and the order of operations for the entire pipeline.

stages: This keyword defines the sequence of execution. In this specific Maven example, three stages are defined: build, test, and install. These stages run in the exact order they are listed. If a job in the build stage fails, the pipeline will not proceed to the test or install stages, ensuring that only stable code is tested and deployed.
default: This section allows for the definition of settings that apply to all jobs unless overridden. In this configuration, the image is set to maven:3.6.3-openjdk-11. This ensures that every job is executed within a Docker container containing both Maven 3.6.3 and Java 11, providing a consistent runtime environment.
variables: This section defines environment variables accessible to all jobs. These variables are critical for configuring how Maven behaves within the CI environment.

Maven Environment Variables and Optimization

The use of variables in the .gitlab-ci.yml file is not merely for convenience but for technical necessity regarding network protocols and file system paths.

MAVEN_OPTS: This variable contains environment settings required whenever Maven is executed.
-Dhttps.protocols=TLSv1.2: This specific flag sets the TLS protocol to version 1.2 for all HTTP requests. This is crucial for security compliance and ensuring that the runner can successfully communicate with remote Maven repositories that require modern TLS versions.
-Dmaven.repo.local=$CIPROJECTDIR/.m2/repository: This is a critical configuration that redirects the local Maven repository to the GitLab project directory. By default, Maven stores dependencies in the user's home directory, which is not persisted across different jobs or containers. Moving the repository to the project directory allows GitLab's caching mechanism to track and preserve the dependencies.
MAVENCLIOPTS: These are arguments appended to the mvn command to modify the build lifecycle.
-DskipTests: This flag is used to skip the test stage during the package and install goals. This is an optimization strategy to avoid running the full test suite multiple times across different stages of the pipeline.

The Job Execution Lifecycle

The pipeline is divided into three specific jobs: build-JAR, test-code, and install-JAR. Each job is assigned to a specific stage, ensuring a logical progression of the software build process.

build-JAR: Assigned to the build stage. It executes the command mvn $MAVEN_CLI_OPTS package. This job compiles the code and packages it into a JAR file while skipping tests to maximize speed.
test-code: Assigned to the test stage. It executes mvn test. This is the dedicated phase where the code is validated against the test suite.
install-JAR: Assigned to the install stage. It executes mvn $MAVEN_CLI_OPTS install. This job installs the package into the local repository, making it available for other projects.

The script keyword in GitLab functions similarly to the steps block in a Jenkinsfile. While the provided example shows jobs with a single command, GitLab allows for multiple sequential commands within a single script block.

Infrastructure Requirements and Executor Strategies

The execution of a Maven pipeline depends heavily on the type of GitLab Runner and the executor being utilized.

The Shell Executor Approach

For environments that mimic the traditional Jenkins agent behavior, a GitLab Runner with a Shell executor is used. This requires that the physical or virtual machine hosting the runner has the following prerequisites installed:
- Maven 3.6.3
- Java 11 JDK

In this scenario, the commands are executed directly on the host's shell. This is the closest equivalent to Jenkins' "Freestyle with shell execution" or "Freestyle with Maven task plugin" methods.

The Docker Executor Approach

A more modern and flexible approach involves using the Docker executor. Instead of relying on a persistent machine, GitLab spins up an ephemeral container for each job.

Image usage: The image: maven:3.6.3-openjdk-11 directive tells the runner to pull a specific Docker image.
Benefits: This removes the need to manually maintain Java and Maven versions on the runner's host machine. It increases flexibility, as different projects can use different versions of Java or Maven simply by changing the image tag in the YAML file.
Public Runners: For those using GitLab.com, public instance runners can be utilized, removing the need to host and maintain your own infrastructure.

Caching Strategies for Maven Dependencies

One of the most significant performance bottlenecks in Maven pipelines is the repeated downloading of dependencies from remote repositories (such as Maven Central). GitLab solves this using the cache keyword.

The provided configuration uses the following cache settings:

yaml cache: key: $CI_COMMIT_REF_SLUG paths: - .m2/

key: The $CI_COMMIT_REF_SLUG ensures that the cache is specific to the branch or tag being built. This prevents conflicts between different feature branches that may rely on different versions of the same dependency.
paths: By caching the .m2/ directory, GitLab saves the downloaded plugins and dependencies. Because the MAVEN_OPTS variable was used to move the local repository into the project directory, the runner can successfully zip and upload the .m2/ folder to the GitLab cache server and restore it for subsequent jobs. This drastically reduces the execution time of the test-code and install-JAR jobs.

Comparative Technical Summary: Jenkins vs. GitLab CI/CD

The following table provides a technical comparison of the components used in the migration from a Jenkins-based Maven build to a GitLab-based one.

Feature	Jenkins (Declarative)	GitLab CI/CD	Impact of Change
Configuration File	`Jenkinsfile`	`.gitlab-ci.yml`	Version-controlled pipeline as code
Environment	Agent (VM/Physical)	Docker Image	Ephemeral, reproducible environments
Tool Management	`tools` block	`image` keyword	No manual tool installation on agents
Execution Unit	`stage` -> `steps`	`stage` -> `job` -> `script`	More granular control over job dependencies
Dependency Cache	Local workspace/Persistent	`cache` keyword	Faster builds via distributed caching
Variable Scope	Environment/Global	`variables` keyword	Unified variable management across jobs

Final Analysis of Pipeline Efficiency

The migration from Jenkins to GitLab CI/CD for Maven projects represents more than a syntax change; it is an optimization of the entire build lifecycle. By utilizing the Docker executor, the pipeline achieves total isolation, ensuring that the build is not contaminated by leftover artifacts from previous runs.

The use of the MAVEN_OPTS and MAVEN_CLI_OPTS variables allows for a clean separation between the Maven environment configuration (TLS versions and repository paths) and the actual build goals (packaging and installing). The strategic use of -DskipTests during the build and install phases, combined with a dedicated test-code job, ensures that the pipeline is both fast and rigorous.

Furthermore, the integration of the .m2/ directory into the GitLab cache via the $CI_COMMIT_REF_SLUG key creates a highly efficient loop where dependencies are only downloaded once per branch. This architecture reduces network overhead and minimizes the time spent in the "dependency resolution" phase of the Maven lifecycle.

Sources

GitLab CI/CD Migration Examples - Jenkins Maven