Building a Complete Grafana LGTM Observability Platform with Docker Compose
What is LGTM?
LGTM stands for Loki, Grafana, Tempo, and Metrics (usually Prometheus). It’s an open-source toolkit for monitoring your applications and infrastructure.
Think of it as a control center where you can:
- See all your logs in one place (Loki)
- View beautiful dashboards of your system’s performance (Grafana)
- Track requests as they move through your system (Tempo)
- Monitor key metrics like CPU, memory usage, and more (Prometheus)
Why Should You Care?
- Find Problems Faster: When something breaks, quickly see what happened
- Understand Your System: Get insights into how your application performs
- Plan Better: Use real data to make decisions about scaling or optimizing
- Learn by Doing: Perfect project to understand modern monitoring techniques
What You’ll Build
This guide will show you how to create a complete monitoring system using Docker Compose. When finished, you’ll have:
How it works: Your applications send data to the OpenTelemetry Collector, which forwards it to specialized storage systems. Grafana brings everything together in one dashboard.
Ready-to-Use Sample Repository
For convenience, you can directly use or reference a complete sample repository containing all the configuration files and setup described in this guide:
- GitHub Repository: samzhu/grafana-otel-lgtm-stack
This repository includes all the configuration files for:
- OpenTelemetry Collector
- Prometheus
- Loki
- Tempo
- Grafana (with pre-configured datasources and dashboards)
You can either clone this repository to get started immediately or follow the step-by-step guide below to understand each component.
1 | # Clone the repository |
Prerequisites
- Basic knowledge of command line and Docker
- Docker and Docker Compose installed on your computer
- A terminal to run commands
Installing Docker (for Absolute Beginners)
If you don’t have Docker installed yet:
- Windows/Mac: Download and install Docker Desktop
- Linux: Follow the installation instructions for your distribution
Docker Compose is included with Docker Desktop. For Linux, you might need to install it separately.
Quick Start Guide
Step 1: Setting Up Your Environment
Create a new directory for your project:
1 | mkdir grafana-lgtm |
Create the following files with the configurations shown below.
Step 2: Starting the System
1 | # Start all services |
Step 3: Accessing Your New Monitoring System
Open your browser and visit:
- Grafana Dashboard: http://localhost:3000
- Username:
admin
- Password:
admin
- Username:
Congratulations! You now have a professional-grade monitoring system running on your computer.
What’s Included in Your Monitoring System
Main Components
Grafana (http://localhost:3000)
- The dashboard where you view everything
- Pre-configured to show logs, metrics, and traces
Prometheus (http://localhost:9090)
- Stores performance metrics
- Keeps track of numbers like CPU usage, memory consumption, etc.
Loki (http://localhost:3100)
- Stores logs from your applications
- Lets you search through logs efficiently
Tempo (http://localhost:3200)
- Stores traces of requests through your system
- Helps you understand how requests flow through your applications
OpenTelemetry Collector (http://localhost:4318)
- Central receiver for all monitoring data
- Routes each type of data to the right storage system
Understanding OTLP (OpenTelemetry Protocol)
Throughout this guide, you’ll see “OTLP” mentioned. This is the OpenTelemetry Protocol - a standardized way for applications to send monitoring data. Think of it as a common language that your applications speak to tell the monitoring system what’s happening.
Configuration Files Explained
Let’s look at the main configuration files you’ll need to create:
Docker Compose File (docker-compose.yml
)
This file tells Docker how to set up all the services:
1 | services: |
OpenTelemetry Collector Configuration (otel-collector-config.yaml
)
The collector receives data from your applications and sends it to the right place:
1 | receivers: |
Prometheus Configuration (prometheus.yml
)
Prometheus collects metrics from the OpenTelemetry Collector:
1 | global: |
Loki Configuration (loki-local-config.yaml
)
Loki stores and indexes your logs:
1 | auth_enabled: false |
Tempo Configuration (tempo-local.yaml
)
Tempo stores trace data that shows how requests move through your system:
1 | server: |
Grafana Configuration
Data Sources (grafana/datasources.yml
)
This tells Grafana where to find your data:
1 | apiVersion: 1 |
Dashboard Provider (grafana/dashboards.yml
)
This tells Grafana where to find dashboard definitions:
1 | apiVersion: 1 |
Sample Dashboard (grafana/dashboards/springboot-observability.json
)
Create this directory and file to include a sample dashboard:
1 | mkdir -p grafana/dashboards |
1 | { |
Connecting Your Applications
To get the most value from your monitoring system, you need to connect your applications to it. Here’s how to do it for some common languages:
For Java Applications
Add the OpenTelemetry Java Agent when starting your application:
1 | # Download the agent (if you haven't already) |
For Node.js Applications
Install OpenTelemetry packages:
1 | npm install @opentelemetry/sdk-node @opentelemetry/exporter-trace-otlp-http @opentelemetry/exporter-metrics-otlp-http @opentelemetry/exporter-logs-otlp-http |
Add this code to your application:
1 | const { NodeSDK } = require('@opentelemetry/sdk-node'); |
Viewing Your Data
- Open Grafana at http://localhost:3000
- Login with username
admin
and passwordadmin
- Click “Explore” in the left sidebar to query your data
- Select a data source (Prometheus, Loki, or Tempo) and start exploring
Basic Query Examples
Here are some simple queries to get you started:
Prometheus (for metrics)
1 | # Show all metrics for a specific service |
Loki (for logs)
1 | # Show all logs for a specific service |
Tempo (for traces)
Usually, you’ll click on a trace ID in a log or metric to see a trace. But you can also search:
- By service name
- By duration (to find slow requests)
- By trace ID (if you know it)
Common Tasks and Commands
Restarting or Resetting Everything
1 | # Restart all services |
Checking for Problems
If something isn’t working right:
1 | # View logs from all services |
Glossary for Beginners
- Logs: Text messages your applications produce, like “User logged in” or “Error occurred”
- Metrics: Numbers that measure performance, like CPU usage or request count
- Traces: Records of a request as it travels through your system
- Dashboard: A screen showing charts and data visualizations
- OpenTelemetry: A standard way to collect and send monitoring data
- Docker: A tool for running applications in containers
- Docker Compose: A tool for running multiple containers together
Next Steps
Once you’re comfortable with the basics:
- Create custom dashboards for your applications
- Set up alerts to notify you when problems occur
- Add more applications to your monitoring system
- Learn PromQL (Prometheus Query Language) for advanced metrics analysis
- Explore more OpenTelemetry features
Resources for Learning More
- Grafana Documentation
- OpenTelemetry Documentation
- Prometheus Documentation
- Loki Documentation
- Tempo Documentation
Troubleshooting Common Issues
Services Won’t Start
If some services fail to start:
- Make sure all your configuration files are correctly named and placed
- Check for port conflicts - another application might be using the same ports
- Check the logs with
docker compose logs [service-name]
Can’t See Data in Grafana
If Grafana is running but you don’t see data:
- Verify all services are running with
docker compose ps
- Check if data sources are correctly configured in Grafana
- Make sure your application is correctly sending data to the collector
OpenTelemetry Issues
If your application connects but data doesn’t appear:
- Confirm you’re using the correct endpoint URL (http://localhost:4318)
- Check for firewall or network issues blocking connections
- Verify your application is properly configured with the correct service name