ctias-lab

CTIAS Lab Production Operations Guide

Quick Start for Production

System Requirements

Minimum:

4 CPU cores
8 GB RAM
50 GB storage
Docker 20.10+
Docker Compose 2.0+

Recommended:

8 CPU cores
16 GB RAM
100 GB SSD storage
Kubernetes cluster (for high availability)

Installation

Clone and Configure

git clone https://github.com/pangerlkr/ctias-lab.git
cd ctias-lab

# Copy and edit environment file
cp .env.example .env
nano .env  # Update with production values

Update Critical Settings

At minimum, change these in .env:

JWT_SECRET - Generate with: openssl rand -hex 32
POSTGRES_PASSWORD - Use a strong password
ADMIN_PASSWORD - Use a strong password
CORS_ORIGINS - Set to your actual domain

Deploy

# Start all services
docker-compose up -d

# Check status
docker-compose ps

# View logs
docker-compose logs -f

Verify Deployment

# Check API health
curl http://localhost:8000/health

# Check frontend
curl http://localhost:3000

Service Endpoints

Service	URL	Description
Frontend	http://localhost:3000	Web UI
Gateway API	http://localhost:8000	REST API
API Docs	http://localhost:8000/docs	OpenAPI/Swagger
PostgreSQL	localhost:5432	Database
Redis	localhost:6379	Cache

Production Checklist

Before Going Live

Security Hardening

Network Security
- Use HTTPS/TLS everywhere
- Restrict database access to internal network
- Use VPN for administrative access
- Configure firewall rules
Application Security
- Enable rate limiting
- Implement authentication/authorization
- Validate all inputs
- Keep dependencies updated
- Regular security scans
Data Security
- Encrypt data at rest
- Encrypt data in transit
- Regular backups with encryption
- Secure key management

Daily Operations

Monitoring

Health Checks:

# Check all services
docker-compose ps

# API health
curl http://localhost:8000/health

# Check logs for errors
docker-compose logs --tail=100 gateway | grep ERROR

Resource Usage:

# Container stats
docker stats

# Disk usage
df -h

# Database size
docker exec ctias-postgres psql -U ctias -d ctias_lab -c \
  "SELECT pg_size_pretty(pg_database_size('ctias_lab'));"

Backups

Manual Backup:

# Database
docker exec ctias-postgres pg_dump -U ctias ctias_lab > \
  backup-$(date +%Y%m%d-%H%M%S).sql

# Compress
gzip backup-*.sql

Automated Backup Script:

#!/bin/bash
# backup.sh - Add to cron for daily backups

BACKUP_DIR="/backups"
DATE=$(date +%Y%m%d-%H%M%S)

# Database backup
docker exec ctias-postgres pg_dump -U ctias ctias_lab | \
  gzip > $BACKUP_DIR/ctias-db-$DATE.sql.gz

# Keep only last 7 days
find $BACKUP_DIR -name "ctias-db-*.sql.gz" -mtime +7 -delete

echo "Backup completed: ctias-db-$DATE.sql.gz"

Add to crontab:

# Run daily at 2 AM
0 2 * * * /path/to/backup.sh

Updates

Updating the Application:

# Pull latest code
git pull origin main

# Rebuild and restart
docker-compose down
docker-compose build --no-cache
docker-compose up -d

# Verify
docker-compose ps
curl http://localhost:8000/health

Database Migrations:

# If using Alembic
docker exec ctias-gateway alembic upgrade head

Log Management

View Logs:

# All services
docker-compose logs -f

# Specific service
docker-compose logs -f gateway

# Last N lines
docker-compose logs --tail=100 gateway

# Search for errors
docker-compose logs gateway | grep -i error

Log Rotation: Configure in /etc/docker/daemon.json:

{
  "log-driver": "json-file",
  "log-opts": {
    "max-size": "10m",
    "max-file": "3"
  }
}

Troubleshooting

Common Issues

1. Service Won’t Start

# Check logs
docker-compose logs <service-name>

# Verify environment
docker-compose config

# Recreate service
docker-compose up -d --force-recreate <service-name>

2. Database Connection Errors

# Check database is running
docker-compose ps postgres

# Test connection
docker exec -it ctias-postgres psql -U ctias -d ctias_lab

# Check credentials in .env
grep DATABASE_URL .env

3. Out of Memory

# Check memory usage
docker stats

# Restart services
docker-compose restart

# Increase memory limits in docker-compose.yml

4. High CPU Usage

# Identify culprit
docker stats

# Check for runaway queries
docker exec ctias-postgres psql -U ctias -d ctias_lab -c \
  "SELECT pid, query FROM pg_stat_activity WHERE state = 'active';"

5. Disk Space Full

# Check disk usage
df -h

# Clean Docker resources
docker system prune -a --volumes

# Remove old logs
docker-compose logs > /dev/null 2>&1

Performance Issues

Slow API Responses:

Check database query performance
Verify Redis is responding
Check CPU/memory usage
Review application logs
Enable query logging

Database Performance:

# Vacuum and analyze
docker exec ctias-postgres psql -U ctias -d ctias_lab -c \
  "VACUUM ANALYZE;"

# Check slow queries
docker exec ctias-postgres psql -U ctias -d ctias_lab -c \
  "SELECT query, calls, total_time, mean_time FROM pg_stat_statements ORDER BY mean_time DESC LIMIT 10;"

Scaling

Horizontal Scaling

Scale API Gateway:

docker-compose up -d --scale gateway=3

Load Balancing: Use nginx or Traefik as reverse proxy:

upstream gateway {
    server localhost:8000;
    server localhost:8001;
    server localhost:8002;
}

server {
    listen 80;
    server_name api.yourdomain.com;

    location / {
        proxy_pass http://gateway;
    }
}

Vertical Scaling

Edit docker-compose.yml:

gateway:
  deploy:
    resources:
      limits:
        cpus: '2'
        memory: 2G
      reservations:
        cpus: '1'
        memory: 1G

Disaster Recovery

Restore from Backup

Database:

# Stop services
docker-compose stop gateway

# Restore database
gunzip < backup-20240101.sql.gz | \
  docker exec -i ctias-postgres psql -U ctias -d ctias_lab

# Restart services
docker-compose start gateway

Full System Recovery

Install Docker and Docker Compose
Clone repository
Copy backed up .env file
Restore database from backup
Start services
Verify functionality

Monitoring Setup

Prometheus + Grafana

Add to docker-compose.yml:

  prometheus:
    image: prom/prometheus
    volumes:
      - ./prometheus.yml:/etc/prometheus/prometheus.yml
    ports:
      - "9090:9090"

  grafana:
    image: grafana/grafana
    ports:
      - "3001:3000"
    environment:
      - GF_SECURITY_ADMIN_PASSWORD=admin

Alerting

Configure alerts for:

API response time > 2s
Error rate > 5%
Database connections > 80% max
Disk usage > 80%
Memory usage > 90%

Maintenance Windows

Planned Maintenance

Announce maintenance window
Set API to read-only mode
Backup database
Perform updates
Test functionality
Resume normal operations
Monitor for issues

Emergency Maintenance

Take backup if possible
Perform necessary fixes
Restore from backup if needed
Verify functionality
Document incident

Support

Documentation:

Contact:

Email: contact@pangerlkr.link
GitHub Issues: https://github.com/pangerlkr/ctias-lab/issues

Version History

v1.0.0 - Initial production release
- Gateway API with health checks
- IOC analysis endpoints
- Reconnaissance capabilities
- Database models
- Docker deployment
- Security hardening

This site is open source. Improve this page.