LangChain v1 integration¶

This page shows the complete integration of PIIGhost into a LangGraph agent, based on the example available in examples/graph/.

Installation¶

To use the LangChain middleware, install the additional dependencies:

uvpip

uv add piighost langchain langgraph langchain-openai

pip install piighost langchain langgraph langchain-openai

Optional dependency

PIIAnonymizationMiddleware imports langchain when instantiated. If langchain is not installed, an explicit ImportError is raised: "You must install piighost[langchain] for use middleware".

Integration structure¶

GLiNER2 model
    └── GlinerDetector
            └── Anonymizer
                    └── AnonymizationPipeline
                                └── PIIAnonymizationMiddleware
                                            └── create_agent(middleware=[...])

Full example¶

from dotenv import load_dotenv
from gliner2 import GLiNER2
from langchain.agents import create_agent
from langchain_core.tools import tool

from piighost.anonymizer import Anonymizer, GlinerDetector
from piighost.middleware import PIIAnonymizationMiddleware
from piighost.pipeline import AnonymizationPipeline

load_dotenv()

# ---------------------------------------------------------------------------
# 1. Define the agent tools
# ---------------------------------------------------------------------------

@tool
def send_email(to: str, subject: str, body: str) -> str:
    """Send an email to the given address.

    Args:
        to: Recipient email address.
        subject: Email subject line.
        body: Email body text.

    Returns:
        Confirmation string.
    """
    return f"Email successfully sent to {to}."


@tool
def get_weather(country_or_city: str) -> str:
    """Get the current weather for a given location.

    Args:
        country_or_city: Name of the city or country.

    Returns:
        A weather summary string.
    """
    return f"The weather in {country_or_city} is 22°C and sunny."


# ---------------------------------------------------------------------------
# 2. Configure the system prompt for placeholders
# ---------------------------------------------------------------------------

system_prompt = """\
You are a helpful assistant. Some inputs may contain anonymized placeholders \
that replace real values for privacy reasons.

Rules:
1. Treat every placeholder as if it were the real value. Never comment on its \
format, never say it is a token, never ask the user to reveal it.
2. Placeholders can be passed directly to tools — use them as-is as input arguments. \
This preserves the user's privacy while still allowing tools to operate.
3. If the user asks for a specific detail about a placeholder \
(e.g. "what is the first letter?"), reply briefly: "I cannot answer that question \
as the data has been anonymized to protect your personal information."
"""

# ---------------------------------------------------------------------------
# 3. Initialize the anonymization stack
# ---------------------------------------------------------------------------

# Load the GLiNER2 model (HuggingFace download ~500 MB on first run)
extractor = GLiNER2.from_pretrained("fastino/gliner2-multi-v1")

detector = GlinerDetector(model=extractor, threshold=0.5, flat_ner=True)
anonymizer = Anonymizer(detector=detector)
pipeline = AnonymizationPipeline(
    anonymizer=anonymizer,
    labels=["PERSON", "LOCATION"],
)
middleware = PIIAnonymizationMiddleware(pipeline=pipeline)

# ---------------------------------------------------------------------------
# 4. Create the LangGraph agent with the middleware
# ---------------------------------------------------------------------------

graph = create_agent(
    model="openai:gpt-4o-mini",
    system_prompt=system_prompt,
    tools=[send_email, get_weather],
    middleware=[middleware],
)

How the middleware works¶

PIIAnonymizationMiddleware intercepts each agent turn at three points:

`abefore_model` — before the LLM¶

User       : "Send an email to Patrick in Paris"
      ↓
Middleware : NER detection on HumanMessage
           → "Send an email to <<PERSON_1>> in <<LOCATION_1>>"
      ↓
LLM sees   : "Send an email to <<PERSON_1>> in <<LOCATION_1>>"

`awrap_tool_call` — around tools¶

LLM calls    : send_email(to="<<PERSON_1>>", subject="...", body="...")
      ↓
Middleware   : deanonymize args
             → send_email(to="Patrick", subject="...", body="...")
      ↓
Tool receives: to="Patrick"  ← real value
      ↓
Tool returns : "Email successfully sent to Patrick."
      ↓
Middleware   : reanonymize response
             → "Email successfully sent to <<PERSON_1>>."
      ↓
LLM sees     : "Email successfully sent to <<PERSON_1>>."

`aafter_model` — after the LLM¶

LLM replies  : "Done! Email sent to <<PERSON_1>>."
      ↓
Middleware   : deanonymize all messages
             → "Done! Email sent to Patrick."
      ↓
User sees    : "Done! Email sent to Patrick."

Using the agent¶

import asyncio

async def main():
    response = await graph.ainvoke({
        "messages": [{"role": "user", "content": "Send an email to Patrick in Paris"}]
    })
    print(response["messages"][-1].content)
    # Done! Email sent to Patrick.

asyncio.run(main())

With Langfuse (observability)¶

The full example includes Langfuse integration to trace LLM calls:

from langfuse import get_client
from langfuse.langchain import CallbackHandler

langfuse = get_client()
langfuse_handler = CallbackHandler()

graph = create_agent(
    model="openai:gpt-4o-mini",
    system_prompt=system_prompt,
    tools=[send_email, get_weather],
    middleware=[middleware],
    callbacks=[langfuse_handler],  # (1)!
)

Langfuse callbacks are added to create_agent. All LLM interactions are traced — with anonymized text (the tracing layer never sees personal data).

Deployment with Aegra¶

The examples/graph/ example is designed to be deployed with Aegra (a self-hosted LangSmith alternative).

aegra.json file:

{
  "graph": "./src/graph/graph.py:graph",
  "http": "./src/graph/app.py:app",
  "ttl": {
    "interval_minutes": 60,
    "default_minutes": 20160
  }
}

# Start the development server (graph + FastAPI on port 8000)
uv run aegra dev

# Full stack with PostgreSQL
docker compose up --build

Environment variables¶

Copy .env.example to .env and fill in:

# LLM
OPENAI_API_KEY=sk-...
# or
ANTHROPIC_API_KEY=sk-ant-...

# Aegra (required)
AEGRA_CONFIG=aegra.json

# Database
DATABASE_URL=postgresql://user:pass@localhost:5432/piighost

# Observability (optional)
LANGFUSE_PUBLIC_KEY=...
LANGFUSE_SECRET_KEY=...