One-click install

Managed Dify hosting

The RAG-first LLM application platform — deployed, managed, and kept up to date by HostAgentes. Build knowledge-base chatbots, RAG pipelines, and multi-LLM workflows without managing Postgres, Redis, or a vector store.

Unlimited apps. Unlimited messages. Built-in vector store. From $9.99/mo.

What is managed Dify hosting?

Dify is an open-source LLM application platform purpose-built for retrieval-augmented generation (RAG), visual agent workflows, and production LLM app management. It ships a prompt IDE, a built-in vector store, a RAG pipeline builder, dataset versioning, and conversation-level observability — all in one self-hosted package. Managed Dify hosting means a third-party provider deploys and maintains a private Dify instance on your behalf, handling infrastructure so you can focus on building knowledge bases and LLM applications instead of managing Postgres, Redis, object storage, and vector indices yourself.

HostAgentes offers managed Dify hosting in three tiers: Basic at $9.99/mo (1 vCPU, 2 GB RAM, built-in vector store), Pro at $29.99/mo (2 vCPU, team management, production observability), and Scale at $79.99/mo (4 vCPU, multiple workspaces, target 99.9% uptime SLA). Unlike Dify Cloud — which charges per message — HostAgentes charges for dedicated compute with no usage caps on apps or conversations.

Every HostAgentes Dify instance comes with pre-configured Weaviate or Qdrant for vector storage, PostgreSQL for app state, Redis for caching, and S3-compatible object storage for uploaded documents. SSL is provisioned automatically. Deployment takes under five minutes with no Docker Compose or CLI commands required.

Simple, transparent pricing

No message limits. No per-seat charges. Pay for compute, not usage.

Basic

$9.99 /mo

For experimenting with Dify and building your first RAG-powered apps.

  • 1 vCPU, 2 GB RAM
  • 20 GB storage (vector + object)
  • Unlimited LLM apps
  • Unlimited messages
  • Bring your own model keys (BYOK)
  • Built-in vector store
  • SSL and automatic backups
  • Community support
Get started
Most popular

Pro

$29.99 /mo

For teams building production RAG pipelines and multi-LLM workflows.

  • Everything in Basic
  • 2 vCPU, 4 GB RAM
  • 50 GB storage (vector + object)
  • Team management + roles
  • Advanced RAG pipeline builder
  • Production observability dashboard
  • Email support
  • Automatic zero-downtime updates
Get started

Scale

$79.99 /mo

For organizations with multiple high-traffic LLM applications.

  • Everything in Pro
  • 4 vCPU, 8 GB RAM
  • 100 GB storage (vector + object)
  • Multiple workspaces
  • Enterprise SSO
  • Priority support
  • Target 99.9% uptime SLA
Get started

7-day money-back guarantee. Cancel at any time.

The price you see is the price you pay. No renewal surprises, no hidden fees.

The full Dify stack, managed

Every feature Dify ships — vector store, RAG pipelines, multi-LLM routing, observability — live on your instance from day one.

database

Built-in vector store

Weaviate and Qdrant available out of the box. Upload PDFs, Word docs, Notion exports, or web URLs and Dify handles embedding, indexing, and retrieval automatically — no external vector DB setup required.

code

Visual prompt IDE

Design, test, and version prompts without touching a terminal. Compare outputs across GPT-4o, Claude 4, and Gemini side-by-side. Promote a prompt version to production with a single click.

account_tree

RAG pipeline builder

Chain document retrieval, re-ranking, and LLM calls in a visual canvas. Configure chunk size, overlap, and hybrid search weighting per knowledge base — no Python required.

hub

Multi-LLM routing

Switch between OpenAI, Anthropic Claude, Google Gemini, Mistral, and any OpenAI-compatible endpoint per app. No vendor lock-in. Your keys, your costs, no markup.

monitoring

Production observability

Track latency, token spend, and retrieval quality per conversation. Identify hallucination-prone queries, slow retrievals, and cost outliers from the same dashboard you build in.

storage

Dataset management

Version your knowledge bases independently from your apps. Roll back to a previous document set after a bad import, or A/B test two corpora against the same prompt.

Popular Dify apps on HostAgentes

What teams actually build once they have a managed Dify instance with a live vector store and no message caps.

1

Product documentation chatbot

Index a 500-page technical manual into a knowledge base. Dify retrieves the right section and generates a cited, grounded answer — no hallucinated API endpoints.

2

Internal wiki Q&A

Connect a SharePoint or Confluence export. Employees ask questions in plain English; Dify finds the right policy page and summarizes it with a direct link.

3

Customer support copilot

Combine a live CRM lookup tool with a knowledge base of resolved tickets. The agent checks order status in real time, then grounds its reply in your support playbook.

4

Code review assistant

Feed open GitHub PR diffs as context. The agent reviews for security anti-patterns, style violations, and missing tests — scoped to your team's standards document.

5

Research summary agent

Ingest arXiv abstracts or uploaded papers on a schedule. Ask a natural-language question; the agent retrieves the most relevant passages and synthesizes a structured summary.

6

Legal contract Q&A

Vector-index a library of precedents and clause templates. Lawyers query specific indemnification language or governing-law clauses across hundreds of contracts in seconds.

A Dify app in 30 lines of DSL

Dify apps are portable YAML definitions. Export from one instance, import to another, or version-control them alongside your knowledge base configurations. The snippet below is a production-ready RAG agent that retrieves from a versioned knowledge base, applies hybrid search, and falls back to web search for unknown topics.

docs-qa-agent.yaml
# Dify DSL — RAG agent with document retrieval
app:
  name: "Docs QA Agent"
  mode: agent-chat
  description: "Answers questions from the product knowledge base"

model:
  provider: anthropic
  name: claude-sonnet-4-5
  temperature: 0.2

knowledge_bases:
  - id: product-docs-v3
    retrieval_mode: hybrid          # semantic + keyword
    top_k: 6
    score_threshold: 0.55

tools:
  - type: knowledge_retrieval
    knowledge_base_id: product-docs-v3
  - type: web_search               # fallback for unknown topics

system_prompt: |
  You are a precise product support assistant.
  Always cite the source section when answering.
  If the knowledge base does not contain a relevant answer, say so.

Import this DSL via Dify's Studio → Import app. Swap product-docs-v3 for your own knowledge base ID.

From zero to RAG in 3 steps

1

Pick a plan and deploy

Choose Basic, Pro, or Scale. HostAgentes provisions Postgres, Redis, object storage, and Weaviate automatically. Your Dify instance is live with a TLS endpoint in under 5 minutes.

2

Add your LLM keys and upload documents

Paste your OpenAI, Anthropic, or Gemini API key in Dify's settings. Then create a knowledge base: upload PDFs, paste URLs, or connect a Notion integration. Dify chunks, embeds, and indexes them automatically.

3

Build, publish, and monitor

Use the prompt IDE to craft your system prompt, configure retrieval parameters, and publish your app as a chatbot widget or REST API. Watch latency, token spend, and retrieval quality in the observability dashboard.

Why HostAgentes for Dify?

Dify's self-hosted stack is complex. HostAgentes tames it so you ship knowledge-base apps instead of managing containers.

Vector store included — no extra bill

Self-hosting Dify means provisioning and maintaining a separate Weaviate or Qdrant cluster. On HostAgentes, the vector store runs alongside your Dify instance on the same plan. No extra cloud service, no separate SLA to track.

No per-message pricing trap

Dify Cloud charges by message volume: plans cap at a fixed message count and charge overages. HostAgentes charges for dedicated vCPU and RAM. Your RAG app can answer thousands of questions a day — the bill stays the same.

All dependencies managed

A production Dify install requires Postgres, Redis, Celery workers, object storage, and a vector DB — all with compatible versions. HostAgentes provisions, updates, and monitors every dependency. You never open a Docker Compose file.

Pair it with Paperclip for agents

Dify excels at RAG applications. For multi-step autonomous agents with persistent memory and tool-calling orchestration, Paperclip is the right complement. Run both from the same HostAgentes account and let each tool do what it does best.

Frequently asked questions

What makes Dify different from LangChain?
LangChain is a Python framework: you write code to chain prompts, tools, and retrievers. Dify is a full application platform with a visual IDE, built-in vector store, dataset management, and a production observability layer — no code required for most use cases. Developers who want LangChain-style control can still call Dify's REST API from any backend. They solve different problems: LangChain is a building block; Dify is a complete LLM app platform.
Can I use my own vector database with Dify?
Yes. Dify supports Weaviate, Qdrant, Pinecone, Milvus, and pgvector as external vector store backends. On HostAgentes, Weaviate and Qdrant are pre-configured and available immediately. If your organization already runs a Pinecone or Milvus cluster, you can point Dify at it from the dataset settings page — no infrastructure changes on your HostAgentes instance required.
How do BYOK LLM keys work on Dify?
Dify is BYOK (Bring Your Own Keys) by design. You add your OpenAI, Anthropic, Google, Mistral, or other provider API keys in the Dify settings panel. Requests go directly from your HostAgentes instance to the model provider — HostAgentes never proxies or marks up LLM calls. You see your exact token costs in your provider's dashboard and in Dify's observability view.
Can I run Dify and Paperclip on the same HostAgentes account?
Yes. Dify and Paperclip serve complementary use cases. Dify is best for RAG-heavy LLM applications, chatbots, and knowledge bases. Paperclip is HostAgentes' multi-agent orchestration product built for long-running autonomous agents with persistent memory. Many teams run both: Dify for customer-facing Q&A apps and Paperclip for backend agent workflows. Each is billed as a separate plan.
Is vector data encrypted at rest?
Yes. All storage on HostAgentes — including the vector index and uploaded documents — is encrypted at rest using AES-256. Connections to your Dify instance use TLS 1.2 or higher. On Scale plan instances, encryption keys are rotated on a schedule. HostAgentes staff cannot read the contents of your knowledge bases.

Deploy your Dify instance

Vector store, RAG pipelines, multi-LLM routing, and production observability — live in under 5 minutes. Unlimited apps and messages from $9.99/mo.