Skip to content

Crawl4AI (JS-capable web extraction)

1. Overview

crawl4ai is an Atlas service family in the media category. Its implementation and service-owned documentation live under services/crawl4ai/.

2. Role In Atlas

Atlas uses this service according to its manifest, topology row, SOURCE settings, dependencies, and runtime data-flow declarations.

3. Tracks And Category

  • Category: media
  • Kind: container
  • Tracks: all, gen-ai-rag

4. Access

  • Kong aliases: crawl4ai.localhost
  • Port variables: CRAWL4AI_PORT

5. Configuration

  • SOURCE variables: CRAWL4AI_SOURCE
  • Default SOURCE values: disabled
  • Available SOURCE values: container, disabled

6. Dependencies And Topology

  • Required dependencies: -
  • Optional dependencies: local-deep-researcher, n8n, backend, weaviate
  • Runtime calls: -

7. Source Values

SOURCE Variable Default Values
CRAWL4AI_SOURCE disabled container, disabled

8. Runtime Integration

The manifest data-flow list declares runtime calls to -. The topology row supplies aliases and port surfaces used by the generated gateway and service references.

9. Architecture

10. Operations

Use ./start.sh to configure this service through the wizard or pass the matching SOURCE flag when the service is source-configurable. Use ./stop.sh to stop the active Atlas project.

11. Source Documentation