Add Distributed Tracing with Tempo

Time to Complete: 25 minutes Goal: Add OpenTelemetry instrumentation to your service and view distributed traces in Grafana Tempo.

What You'll Learn

By the end of this tutorial, you will have:

✅ Instrumented your application with OpenTelemetry
✅ Configured trace export to Grafana Tempo
✅ Generated traces by making requests to your service
✅ Viewed and analyzed traces in the Grafana UI

Prerequisites

Before you begin, ensure you have:

[ ] Completed Tutorial 1: Deploy Your First Service
[ ] Your hello-fawkes service running and accessible
[ ] Access to Grafana (typically at https://grafana.127.0.0.1.nip.io)
[ ] Basic understanding of distributed tracing concepts (helpful but not required)

What is Distributed Tracing?

Distributed tracing tracks requests as they flow through multiple services. Each request gets a unique trace ID, and each service operation creates a "span". This helps you debug performance issues and understand system behavior. Learn more about Unified Telemetry.

Step 1: Install OpenTelemetry Dependencies

We'll add OpenTelemetry instrumentation to the Node.js application we created in Tutorial 1.

Navigate to your hello-fawkes directory:

cd hello-fawkes

Install OpenTelemetry packages:

npm install --save \
  @opentelemetry/api \
  @opentelemetry/sdk-node \
  @opentelemetry/auto-instrumentations-node \
  @opentelemetry/exporter-trace-otlp-http

Update package.json to save the dependencies:

git add package.json package-lock.json
git commit -m "Add OpenTelemetry dependencies"

Checkpoint

OpenTelemetry dependencies are installed and ready to use.

Step 2: Create OpenTelemetry Configuration

Create a new file tracing.js in your project root:

const { NodeSDK } = require("@opentelemetry/sdk-node");
const { getNodeAutoInstrumentations } = require("@opentelemetry/auto-instrumentations-node");
const { OTLPTraceExporter } = require("@opentelemetry/exporter-trace-otlp-http");
const { Resource } = require("@opentelemetry/resources");
const { SemanticResourceAttributes } = require("@opentelemetry/semantic-conventions");

// Configure the trace exporter
const traceExporter = new OTLPTraceExporter({
  url: process.env.OTEL_EXPORTER_OTLP_ENDPOINT || "http://tempo.fawkes-platform.svc.cluster.local:4318/v1/traces",
});

// Create resource with service information
const resource = new Resource({
  [SemanticResourceAttributes.SERVICE_NAME]: process.env.OTEL_SERVICE_NAME || "hello-fawkes",
  [SemanticResourceAttributes.SERVICE_VERSION]: process.env.SERVICE_VERSION || "1.0.0",
  [SemanticResourceAttributes.DEPLOYMENT_ENVIRONMENT]: process.env.ENVIRONMENT || "development",
});

// Initialize the SDK
const sdk = new NodeSDK({
  resource: resource,
  traceExporter: traceExporter,
  instrumentations: [
    getNodeAutoInstrumentations({
      // Customize instrumentation
      "@opentelemetry/instrumentation-fs": {
        enabled: false, // Disable file system instrumentation for cleaner traces
      },
    }),
  ],
});

// Start the SDK
sdk.start();

// Graceful shutdown
process.on("SIGTERM", () => {
  sdk
    .shutdown()
    .then(() => console.log("Tracing terminated"))
    .catch((error) => console.log("Error terminating tracing", error))
    .finally(() => process.exit(0));
});

console.log("OpenTelemetry tracing initialized");

Update server.js to load tracing first:

// Load tracing before anything else
require("./tracing");

const express = require("express");
const app = express();
const PORT = process.env.PORT || 8080;

app.get("/", (req, res) => {
  res.json({
    message: "Hello from Fawkes!",
    timestamp: new Date().toISOString(),
    version: "1.0.0",
    tracing: "enabled",
  });
});

app.get("/health", (req, res) => {
  res.json({ status: "healthy" });
});

// Add a new endpoint to simulate a traced operation
app.get("/api/data", async (req, res) => {
  // Simulate some work
  await new Promise((resolve) => setTimeout(resolve, 100));

  res.json({
    data: [
      { id: 1, name: "Item 1" },
      { id: 2, name: "Item 2" },
      { id: 3, name: "Item 3" },
    ],
    traceId: req.headers["x-trace-id"] || "auto-generated",
  });
});

app.listen(PORT, "0.0.0.0", () => {
  console.log(`Server running on port ${PORT}`);
});

Commit the changes:

git add tracing.js server.js
git commit -m "Add OpenTelemetry instrumentation"

Checkpoint

Your application is now instrumented with OpenTelemetry!

Step 3: Update Kubernetes Deployment

We need to configure environment variables for the OpenTelemetry exporter.

Update k8s/deployment.yaml to add environment variables:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: hello-fawkes
  namespace: my-first-app
  labels:
    app: hello-fawkes
    version: v2
spec:
  replicas: 2
  selector:
    matchLabels:
      app: hello-fawkes
  template:
    metadata:
      labels:
        app: hello-fawkes
        version: v2
    spec:
      containers:
        - name: hello-fawkes
          image: YOUR-USERNAME/hello-fawkes:v2.0.0 # Update version
          ports:
            - containerPort: 8080
              name: http
          env:
            - name: PORT
              value: "8080"
            - name: OTEL_SERVICE_NAME
              value: "hello-fawkes"
            - name: SERVICE_VERSION
              value: "2.0.0"
            - name: ENVIRONMENT
              value: "development"
            - name: OTEL_EXPORTER_OTLP_ENDPOINT
              value: "http://tempo.fawkes-platform.svc.cluster.local:4318/v1/traces"
          livenessProbe:
            httpGet:
              path: /health
              port: 8080
            initialDelaySeconds: 10
            periodSeconds: 10
          readinessProbe:
            httpGet:
              path: /health
              port: 8080
            initialDelaySeconds: 5
            periodSeconds: 5
          resources:
            requests:
              memory: "128Mi" # Increased for tracing overhead
              cpu: "100m"
            limits:
              memory: "256Mi" # Increased for tracing overhead
              cpu: "200m"
          securityContext:
            runAsNonRoot: true
            runAsUser: 1000
            allowPrivilegeEscalation: false
            readOnlyRootFilesystem: true

Commit the updated manifest:

git add k8s/deployment.yaml
git commit -m "Configure OpenTelemetry environment variables"

!!! info "Why These Environment Variables?" - OTEL_SERVICE_NAME: Identifies your service in traces - OTEL_EXPORTER_OTLP_ENDPOINT: Where to send traces (Tempo endpoint) - ENVIRONMENT: Helps filter traces by environment (dev/staging/prod)

Checkpoint

Deployment is configured to export traces to Tempo.

Step 4: Build and Deploy Updated Application

Build the new version of your container:

docker build -t YOUR-USERNAME/hello-fawkes:v2.0.0 .

Push the image:

docker push YOUR-USERNAME/hello-fawkes:v2.0.0

Push your code changes to Git:

git push

If using ArgoCD, it will automatically sync. If not, apply manually:

kubectl apply -f k8s/deployment.yaml

Watch the rollout:

kubectl rollout status deployment/hello-fawkes -n my-first-app

Verify the new pods are running:
```
kubectl get pods -n my-first-app
```

Checkpoint

Your updated application with tracing is deployed and running!

Step 5: Generate Traces

Now let's create some traces by making requests to our service.

Make some requests to generate traces:

# Make multiple requests
for i in {1..10}; do
  curl https://hello-fawkes.127.0.0.1.nip.io/
  sleep 1
done

Make requests to the new /api/data endpoint:

# Generate traces with the data endpoint
for i in {1..10}; do
  curl https://hello-fawkes.127.0.0.1.nip.io/api/data
  sleep 1
done

Mix in some health check requests:

curl https://hello-fawkes.127.0.0.1.nip.io/health

Generate Realistic Traffic

The more varied your requests, the more interesting your traces will be. Try different endpoints and patterns.

Checkpoint

You've generated trace data that should now be visible in Grafana Tempo!

Step 6: View Traces in Grafana

Now for the exciting part - seeing your traces visualized!

Open Grafana in your browser:

https://grafana.127.0.0.1.nip.io

Log in with your Grafana credentials (ask your platform team if you don't have them).
Navigate to Explore (compass icon in left sidebar).
Select Tempo as the data source from the dropdown at the top.
In the query builder:
Select Search tab
Service Name: hello-fawkes
Click Run query
You should see a list of traces! Click on one to expand it.
In the trace view, you'll see:
Timeline: Visual representation of span durations
Span details: Operation names, durations, tags
Service map: Shows service dependencies (even for a single service)

Understanding the Trace View

Each horizontal bar is a "span" representing an operation. Nested spans show parent-child relationships. Longer bars indicate slower operations.

Checkpoint

You're viewing distributed traces in Grafana Tempo! 🎉

Step 7: Analyze a Trace

Let's understand what you're seeing in the trace view.

Pick a trace for the /api/data endpoint.
Expand the spans to see the hierarchy:

GET /api/data (root span)
├─ Express middleware
├─ Route handler
└─ HTTP response

Look at the span details:
Duration: How long this operation took
Tags: Metadata like HTTP method, status code, URL
Logs: Any events recorded during this span
Compare traces:
Click on multiple traces to see timing variations
Look for patterns in slow requests

What Makes a Good Trace?

A well-instrumented trace shows you exactly where time is spent. You should be able to answer: "Which operation is slow?" without looking at code.

Step 8: Create a Grafana Dashboard (Optional)

For ongoing monitoring, create a dashboard to visualize trace metrics.

In Grafana, go to Dashboards → New → New Dashboard.
Click Add visualization.
Select Tempo as the data source.
Create a panel showing request rate:
Query: Use TraceQL: { service.name="hello-fawkes" }
Visualization: Time series
Add another panel for duration percentiles:
This shows p50, p95, p99 latencies over time
Save the dashboard as "Hello Fawkes - Tracing".

Checkpoint

You now have a dashboard to monitor your service's trace data continuously!

What You've Accomplished

Congratulations! You've successfully:

✅ Instrumented a Node.js application with OpenTelemetry
✅ Configured trace export to Grafana Tempo
✅ Deployed the instrumented application to Fawkes
✅ Generated and viewed distributed traces
✅ Analyzed trace data to understand application behavior

What's Next?

Continue your Fawkes journey:

Consume Vault Secrets - Secure your application configuration
Measure DORA Metrics - See how tracing contributes to observability metrics
How to Trace Requests with Tempo - Advanced tracing techniques

Troubleshooting

No Traces Appearing in Grafana

Check that Tempo is running:

kubectl get pods -n fawkes-platform -l app=tempo

Verify your application can reach Tempo:

kubectl exec -n my-first-app deployment/hello-fawkes -- \
  wget -O- http://tempo.fawkes-platform.svc.cluster.local:4318/v1/traces

Check application logs for tracing errors:

kubectl logs -n my-first-app -l app=hello-fawkes | grep -i otel

Traces Appear but Are Incomplete

Ensure tracing.js is loaded before other modules in server.js
Check that auto-instrumentation is enabled for Express
Verify resource limits aren't too restrictive

High Memory Usage After Adding Tracing

Tracing adds ~20-30MB overhead per container
Adjust resource limits if needed
Consider sampling: only trace a percentage of requests in production

Learn More

Unified Telemetry Explanation - How metrics, logs, and traces work together
How to Trace Requests with Tempo - Advanced tracing patterns
OpenTelemetry Documentation - Official OpenTelemetry docs

Feedback

How was this tutorial? Did you successfully view your traces? Share your experience in the Fawkes Community Mattermost!