Skip to content

Configuration Reference

This page documents all available configuration options for Bizon pipelines.

A complete pipeline configuration file:

name: my-pipeline
source:
type: <source_type>
config:
# Source-specific configuration
destination:
type: <destination_type>
config:
# Destination-specific configuration
queue:
type: python | rabbitmq | kafka
config:
# Queue-specific configuration
backend:
type: sqlite | postgres | bigquery
config:
# Backend-specific configuration
runner:
mode: thread | process | stream
workers: 4 # Number of parallel workers
metrics:
enabled: true
provider: datadog | opentelemetry
config:
# Provider-specific configuration
OptionTypeDescription
typestringSource type identifier
configobjectSource-specific configuration
destination:
type: bigquery
config:
project_id: your-gcp-project
dataset: your_dataset
table: your_table
credentials_path: /path/to/credentials.json # Optional
queue:
type: python
config:
max_size: 10000 # Maximum queue size
queue:
type: rabbitmq
config:
host: localhost
port: 5672
username: guest
password: guest
virtual_host: /
queue_name: bizon
queue:
type: kafka
config:
bootstrap_servers: localhost:9092
topic: bizon-pipeline
consumer_group: bizon-group
backend:
type: sqlite
config:
path: :memory: # Or path to file
backend:
type: postgres
config:
connection_string: postgresql://user:pass@host:5432/db
checkpoint_interval: 1000
backend:
type: bigquery
config:
project_id: your-gcp-project
dataset: bizon_state
runner:
mode: thread # thread, process, or stream
workers: 4 # Number of parallel workers
batch_size: 1000 # Records per batch
metrics:
enabled: true
provider: datadog
config:
api_key: your-api-key
app_key: your-app-key
metrics:
enabled: true
provider: opentelemetry
config:
endpoint: http://localhost:4317
service_name: bizon-pipeline

Configuration values can reference environment variables:

destination:
type: bigquery
config:
project_id: ${GCP_PROJECT_ID}