YAML Pipeline Deployment¶

Hayhooks supports deploying Haystack pipelines directly from YAML definitions. This approach builds request/response schemas automatically from the YAML-declared inputs and outputs.

Overview¶

YAML pipeline deployment is ideal for:

Simple pipelines with clear inputs and outputs
Quick deployment without wrapper code
Automatic schema generation
CI/CD pipeline deployments

Converting Existing Pipelines

If you already have a Haystack Pipeline instance, you can serialize it with pipeline.dumps() and then manually add the required inputs and outputs sections before deploying.

Requirements¶

YAML Structure¶

Your YAML file must include both inputs and outputs sections:

components:
  converter:
    type: haystack.components.converters.html.HTMLToDocument
    init_parameters:
      extraction_kwargs: null

  fetcher:
    init_parameters:
      raise_on_failure: true
      retry_attempts: 2
      timeout: 3
      user_agents:
        - haystack/LinkContentFetcher/2.0.0b8
    type: haystack.components.fetchers.link_content.LinkContentFetcher

  llm:
    init_parameters:
      api_key:
        env_vars:
          - OPENAI_API_KEY
        strict: true
        type: env_var
      generation_kwargs: {}
      model: gpt-4o-mini
    type: haystack.components.generators.chat.openai.OpenAIChatGenerator

  prompt:
    init_parameters:
      template: |
        {% message role="user" %}
        According to the contents of this website:
        {% for document in documents %}
          {{document.content}}
        {% endfor %}
        Answer the given question: {{query}}
        {% endmessage %}
      required_variables: "*"
    type: haystack.components.builders.chat_prompt_builder.ChatPromptBuilder

connections:
  - receiver: converter.sources
    sender: fetcher.streams
  - receiver: prompt.documents
    sender: converter.documents
  - receiver: llm.messages
    sender: prompt.prompt

inputs:
  urls: fetcher.urls
  query: prompt.query

outputs:
  replies: llm.replies

Key Requirements¶

inputs Section: Maps friendly names to pipeline component fields
outputs Section: Maps pipeline outputs to response fields
Valid Components: All components must be properly defined
Valid Connections: All connections must reference existing components

Deployment Methods¶

CLIHTTP APIPython

# Deploy with default settings
hayhooks pipeline deploy-yaml pipelines/chat_pipeline.yml

# Deploy with custom name
hayhooks pipeline deploy-yaml -n my_chat_pipeline pipelines/chat_pipeline.yml

# Deploy with description
hayhooks pipeline deploy-yaml -n my_chat_pipeline --description "Chat pipeline for Q&A" pipelines/chat_pipeline.yml

# Overwrite existing pipeline
hayhooks pipeline deploy-yaml -n my_chat_pipeline --overwrite pipelines/chat_pipeline.yml

curl -X POST \
  http://localhost:1416/deploy-yaml \
  -H 'Content-Type: application/json' \
  -d '{
    "name": "my_chat_pipeline",
    "description": "Chat pipeline for Q&A",
    "source_code": "...",
    "overwrite": false
  }'

import requests

response = requests.post(
    "http://localhost:1416/deploy-yaml",
    json={
        "name": "my_chat_pipeline",
        "description": "Chat pipeline for Q&A",
        "source_code": "...",  # Your YAML content as string
        "overwrite": False
    }
)
print(response.json())

CLI Options¶

Option	Short	Description	Default
`--name`	`-n`	Override the pipeline name	YAML file stem
`--description`		Human-readable description	Pipeline name
`--overwrite`	`-o`	Overwrite if pipeline exists	`false`
`--skip-mcp`		Skip MCP tool registration	`false`
`--save-file`		Save YAML to server	`true`
`--no-save-file`		Don't save YAML to server	`false`

Input/Output Mapping¶

Input Mapping¶

The inputs section maps friendly names to pipeline component fields:

inputs:
  # friendly_name: component.field
  urls: fetcher.urls
  query: prompt_builder.query

Mapping rules:

Use component.field syntax
Field must exist in the component
Multiple inputs can map to the same component field
Input names become API parameters
Use a YAML list when the same external field should feed multiple component inputs

inputs:
  query:
    - chat_summary_prompt_builder.query
    - answer_builder.query

How multi-target inputs are resolved

Hayhooks normalizes list-declared inputs by looking at the first valid target to derive type metadata and marks the input as required regardless of component-level metadata. At runtime the resolved value is fanned out to all listed component inputs, so the example above sends the same payload to both chat_summary_prompt_builder.query and answer_builder.query even if the external parameter is named differently (for example a_query). This ensures downstream components get the expected inputs while the API still exposes a single friendly field.

Output Mapping¶

The outputs section maps pipeline outputs to response fields:

outputs:
  # response_field: component.field
  replies: llm.replies
  documents: fetcher.documents

Mapping rules:

Use component.field syntax
Field must exist in the component
Response fields are serialized to JSON
Complex objects are automatically serialized

Automatic include_outputs_from Derivation

Hayhooks automatically derives the include_outputs_from parameter from your outputs section. This ensures that all components referenced in the outputs are included in the pipeline results, even if they're not leaf components.

Example: If your outputs reference retriever.documents and llm.replies, Hayhooks automatically sets include_outputs_from={"retriever", "llm"} when running the pipeline.

What this means: You don't need to configure anything extra - just declare your outputs in the YAML, and Hayhooks ensures those component outputs are available in the results!

Comparison with PipelineWrapper

YAML Pipelines (this page): include_outputs_from is automatic - derived from your outputs section

PipelineWrapper: include_outputs_from must be manually passed:

For streaming: Pass to streaming_generator() / async_streaming_generator()
For non-streaming: Pass to pipeline.run() / pipeline.run_async()

See PipelineWrapper: include_outputs_from for examples.

API Usage¶

After Deployment¶

Once deployed, your pipeline is available at:

Run Endpoint: /{pipeline_name}/run
Schema: /{pipeline_name}/schema
OpenAPI: /openapi.json

Example Request¶

curl -X POST \
  http://localhost:1416/my_chat_pipeline/run \
  -H 'Content-Type: application/json' \
  -d '{
    "urls": ["https://haystack.deepset.ai"],
    "query": "What is Haystack?"
  }'

Example Response¶

{
  "replies": ["Haystack is an open source framework..."],
  "documents": [...],
  "metadata": {...}
}

Schema Generation¶

Hayhooks automatically generates:

Request Schema¶

{
  "type": "object",
  "properties": {
    "urls": {
      "type": "array",
      "items": {"type": "string"}
    },
    "query": {"type": "string"}
  },
  "required": ["urls", "query"]
}

Response Schema¶

{
  "type": "object",
  "properties": {
    "replies": {"type": "array"},
    "documents": {"type": "array"},
    "metadata": {"type": "object"}
  }
}

Obtaining YAML from Existing Pipelines¶

You can obtain YAML from existing Haystack pipelines:

from haystack import Pipeline

# Create or load your pipeline
pipeline = Pipeline()
# ... add components and connections ...

# Get YAML representation
yaml_content = pipeline.dumps()

# Save to file
with open("pipeline.yml", "w") as f:
    f.write(yaml_content)

Note

You'll need to manually add the inputs and outputs sections to the generated YAML.

Limitations¶

YAML Pipeline Limitations

YAML-deployed pipelines have the following limitations:

No OpenAI Compatibility: Don't support OpenAI-compatible chat endpoints
No Streaming: Streaming responses are not supported
No File Uploads: File upload handling is not available
Async Only: Pipelines are run as AsyncPipeline instances

Using PipelineWrapper for Advanced Features

For advanced features, use PipelineWrapper instead:

# For OpenAI compatibility
class PipelineWrapper(BasePipelineWrapper):
    def run_chat_completion(self, model: str, messages: list[dict], body: dict) -> Union[str, Generator]:
        ...

# For file uploads
class PipelineWrapper(BasePipelineWrapper):
    def run_api(self, files: Optional[list[UploadFile]] = None, query: str = "") -> str:
        ...

# For streaming
class PipelineWrapper(BasePipelineWrapper):
    def run_chat_completion_async(self, model: str, messages: list[dict], body: dict) -> AsyncGenerator:
        ...

Example¶

For a complete working example of a YAML pipeline with proper inputs and outputs, see:

tests/test_files/yaml/inputs_outputs_pipeline.yml

This example demonstrates:

Complete pipeline structure with components and connections
Proper inputs mapping to component fields
Proper outputs mapping from component results
Real-world usage with LinkContentFetcher, HTMLToDocument, ChatPromptBuilder, and OpenAIChatGenerator

Next Steps¶

PipelineWrapper - For advanced features like streaming and chat completion
Examples - See working examples