Product · Private RAG for Corporate Knowledge· v2 · Production

DECKLOG.
Your knowledge. Your perimeter. Your answer.

Retrieval-augmented AI grounded in your SharePoint, Drive, Confluence, Notion and ticketing systems — with source-aware permissions, citation-enforced answers, and EU-sovereign deployment that public LLMs cannot match.

See tiers + pricing Talk to engineering

EU Data Boundary verifiedSource-system ACL enforced at query timeGDPR / DORA / NIS2 evidence-ready

Documents leaving your tenant — RAG runs in your perimeter

100%

Answers grounded with verifiable source citations

<3 sec

Median answer latency on production-tier deployments

Quarterly

Evaluation refresh catches regressions before users do

The bleed

Your knowledge is the asset. Your search is the failure mode.

Every modern company has the same problem at a different scale. Documents proliferate; the right answer is somewhere; nobody can find it; the public LLMs that could find it cannot be trusted with the corpus. The result is a quiet, compounding productivity tax.

Knowledge scattered across six systems nobody searches

SharePoint, Confluence, Notion, Drive, Slack, the policy PDFs nobody opens. New hires take six weeks to find the on-boarding doc. Senior staff get interrupted constantly with questions the handbook already answers in section 4.

Public LLMs leak your data and hallucinate your facts

ChatGPT, Claude.ai, Gemini consumer surfaces — convenient until Legal finds your contract clauses in a free-tier transcript. The "we banned ChatGPT" memo solves the leak but leaves the productivity hole nobody filled.

Bolt-on AI that retrieves the wrong document with confidence

Vendor RAG demos look brilliant on a 100-document corpus. Production at 10,000 documents, with real permissions and real updates, breaks the demo magic. The agent answers from a deleted policy. Trust collapses inside a month.

Architecture · Every Layer

What ships inside DECKLOG.

Five concrete layers from ingest to deployment. Built on the production-RAG playbook we deployed at four EU clients over 18 months — with the failure modes we learned about written directly into the architecture.

Layer 01

Source ingestion + ACL inheritance

Microsoft 365: SharePoint Online, OneDrive, Teams, Loop, Exchange — ACL inherited from source at ingest
Google Workspace: Drive, Docs, Sheets, Slides, Sites with Drive sharing-level enforcement
Confluence Cloud + Data Center + Notion + Slite + Slab with workspace-permission mapping
ServiceNow knowledge base + Zendesk Help Center + GitHub Wiki + GitLab Wiki
Slack + Teams chat history (ACL-aware, selective inclusion per channel policy)
Custom: PDF corpora, internal CMS exports, S3-resident document archives

Layer 02

Retrieval pipeline

Hybrid retrieval (BM25 + dense vector) — catches both semantic similarity + exact-identifier lookups
Reranking with Cohere Rerank 3 (managed) or bge-reranker-v2-m3 (self-hosted) — 95% quality parity
Per-query ACL re-validation: group memberships expanded at query time, never trust stored permissions alone
Sensitivity-label honouring (Microsoft Purview integration where licensed)
Time-aware retrieval: newer documents weighted; deprecated documents excluded
Multi-hop retrieval for questions that span multiple documents (Sovereign tier)

Layer 03

Generation + grounding

Generation: Claude Sonnet 4.6 (Bedrock EU) or GPT-4o (Azure OpenAI EU) or self-hosted Llama 3.1 70B FP8
Citation enforcement: every claim links to a retrieved chunk ID; fabricated citations fail validation server-side
Confidence-thresholded refusal: below threshold the agent refuses rather than fabricates
Prompt caching on the system prompt + retrieved context: 70-90% cache hit on multi-turn conversations
Provider abstraction: switching from Anthropic to OpenAI to self-hosted is a config change, not a rewrite
Per-query cost + latency telemetry tied back to the source documents that drove the answer

Layer 04

Evaluation harness

Golden question set built collaboratively with subject-matter experts in the first 4 weeks
Automated retrieval evals (Recall@K, nDCG@K) run on every commit to retrieval code
Automated answer evals (LLM-judged faithfulness, groundedness, completeness) on the golden set
User-feedback loop (thumbs + free-text) sampled into review queue for engineering attention
Regression detection: a model upgrade that drops eval score above tolerance cannot be merged
Quarterly eval refresh: new golden questions, retired stale ones, scored against current performance

Layer 05

Deployment + observability

Deployment surfaces: Microsoft Teams app, Slack bot, web UI hosted in your tenant, browser extension, API access
Langfuse self-hosted in your tenant for trace storage + cost analysis + eval datasets
Vector store options: pgvector (Postgres-native), Qdrant (managed or self-hosted), Pinecone (managed)
Webhook-driven invalidation: SharePoint / Drive / Confluence changes propagate within minutes
Nightly reconciliation: walks source systems vs vector store; flags drift above threshold
GDPR Article 17 right-to-erasure honoured with documented deletion SLA

Engagement Tiers · Pick your depth

Start with one team. Operate across the corpus. Scale to sovereign.

Three productised tiers. Same retrieval pipeline, same evaluation harness. The difference is breadth of source systems + retention + compliance grade.

Tier

Starter

from €1,990 / monthPer tenant + storage tier

Best for

Single-team pilot (HR, Legal, Ops, or Finance) over a focused corpus of <10k documents from one source system.

One source-system connector (SharePoint OR Drive OR Confluence OR Notion)
Up to 10,000 documents indexed
Hybrid retrieval + reranking
Deployment to Teams OR Slack OR web UI
Golden question set with 50 SME-validated questions
30-day post-deploy support window

Outcome

Single team finds answers in seconds; "where is X policy" Slack questions to senior staff drop 60-70%.

Request Starter

Operate

from €4,990 / monthPer tenant + storage tier

Best for

Multi-team operations across HR + Legal + Ops + Finance, multiple source systems, with monthly content growth.

Up to 4 source-system connectors
Up to 50,000 documents indexed
Sensitivity-label honouring (Microsoft Purview integration)
Multi-surface deployment (Teams + Slack + web UI)
Monthly quality dashboard with user-feedback analysis
Quarterly evaluation refresh with new golden questions
Named AI engineer + same-day triage on quality issues

Outcome

Cross-functional knowledge layer; new-hire ramp shortens 3-4x; senior staff stop being interruption-driven.

Request Operate

Tier

Sovereign

from €9,990 / monthPer tenant + storage tier

Best for

Regulated entities (DORA / NIS2 financial services, healthcare, legal practices, public sector) requiring audit-grade RAG with EU sovereignty and 7-year retention.

Everything in Operate
EU Data Boundary verification across model + storage tiers
Self-hosted open-weights option (Llama 3.1 70B FP8 or Qwen 2.5 72B on your infra)
Per-query audit log with regulatory retention (7 years configurable)
Multi-hop retrieval for cross-document regulatory analysis
Custom Copilot Studio / Claude Project for specific business processes
Tabletop exercises for AI-incident response
Dedicated AI Architect + monthly executive review

Outcome

Audit-grade RAG with full evidence chain; defensible posture under DORA + NIS2 supervisory review.

Request Sovereign

Looking for a bespoke implementation owned outright instead of consumed as SaaS? DECKLOG Implementation · from €4,990

Native Connectors

Every source system DECKLOG already speaks.

Microsoft 365, Google Workspace, Atlassian, Notion, ServiceNow, GitHub — the corpus surfaces where modern enterprises actually store knowledge. ACL-aware, webhook-driven, reconciled nightly.

Microsoft SharePoint Online

Native Graph API + sharing-permission inheritance + sensitivity-label honouring

Microsoft OneDrive + Teams + Loop

Personal + shared + collaborative-canvas content fully covered

Google Workspace Drive + Docs

Drive sharing model + Workspace permission groups respected at query time

Confluence Cloud + Data Center

Atlassian workspace permissions mapped, page-level ACL preserved

Notion

Teamspace + page-level permission inheritance with API-driven sync

ServiceNow KB + Zendesk Help Center

Knowledge-base articles with view-permission enforcement

Slack + Microsoft Teams (chat)

Selective channel ingestion with per-channel policy

GitHub + GitLab Wiki

Engineering-doc surfaces with org-membership ACL

Custom PDF + CMS exports

Bulk-ingest pipeline with manual ACL mapping for legacy archives

Azure OpenAI EU + AWS Bedrock EU

Default generation providers in EU regions with Customer Lockbox

Self-hosted Llama / Qwen / Mistral

Sovereign-tier option via vLLM runtime on your bare-metal

Langfuse (self-hosted)

Trace + eval + cost observability inside your perimeter

Where DECKLOG runs

Your tenant. Your region. Your choice of generation provider.

The model + the vector store + the trace history all run inside infrastructure you control. Three deployment patterns covering every EU regulatory profile from "managed-cloud is fine" to "no third-party data plane, period".

Managed EU (default)

Azure OpenAI EU (Sweden Central) or AWS Bedrock EU (Frankfurt) for generation. Vector store inside your Azure / AWS account. Customer Lockbox enabled where supported. Suitable for the majority of regulated EU SMB workloads.

Hybrid sovereign

Generation on EU-region managed providers; vector store + audit log on your bare-metal infrastructure (Hetzner / OVH / IONOS / on-prem). Reduces cloud surface area while retaining managed-model quality.

Full sovereign (self-hosted)

Self-hosted Llama 3.1 70B FP8 or Qwen 2.5 72B (multilingual) on your bare-metal. No third-party data plane. The model weights stay yours. Suitable for the most regulated tenants where US-jurisdiction providers are the threat model itself.

EU Regulatory Alignment

DORA. NIS2. GDPR. ISO 27001. Evidence-ready by design.

The audit trail produces evidence packs as a side effect of operation. The Sovereign tier adds auditor-ready formatting + 7-year retention + DORA Article 28 third-party ICT-risk evidence with sub-processor disclosure.

DORA Article 28

Third-party ICT-risk evidence: model provider, hosting region, sub-processors, exit strategy, concentration analysis.

NIS2 Article 21

Technical baseline: PII redaction at boundary, MFA-bound query surface, audit logging, incident-response integration.

GDPR Articles 17 + 30

Right-to-erasure SLA documented; records of processing produced automatically with sub-processor + retention disclosure.

ISO 27001 Annex A

A.5 organisational + A.8 technological controls mapped per workflow with timestamped evidence chain.

For Your Engineering Team

Measurable. Improvable. Portable.

DECKLOG is not a black-box demo. The eval harness lives in your CI. The traces live in Langfuse inside your tenant. The provider abstraction means switching from Anthropic to OpenAI to self-hosted is a config change, not a rewrite.

Eval harness in your CI

The golden question set + automated retrieval + answer evals run on every retrieval-code commit. Quality regressions get caught before they reach users. The harness is the product, not an afterthought.

Traces in Langfuse

Every query: which documents were retrieved, what the model decided, what the user received, how long it took, what it cost. Langfuse runs self-hosted in your tenant; no third-party trace store.

Clean exit kit

The vector store + the index + the configuration + the eval datasets all live in your cloud account from day one. Cancel the SaaS subscription; the deployment keeps running. No vendor-lock-in clauses; no hostage data.

Existing customer · Open portal Talk to engineering

Buyer Questions

The questions knowledge-management leads ask before purchase.

Do our documents leave the perimeter?

Only if you let them. Default deployment runs in your tenant: Azure OpenAI EU region or AWS Bedrock EU region for generation; vector store inside your cloud account; ingestion pipelines reading from your source systems via OAuth. No third-party data plane. Sovereign tier deploys self-hosted open-weights models (Llama 3.1 70B FP8 or Qwen 2.5 72B) on your bare-metal infrastructure, eliminating the cloud-provider hop entirely.

How does DECKLOG handle permissions on confidential documents?

Source-of-truth permission inheritance at ingest + re-validation at query time. When a document is indexed, DECKLOG records the principals that could read it. When a user queries, DECKLOG expands their current group memberships from your IdP (not from the stored permissions) and filters retrieval results before reranking. A document deleted or re-permissioned in SharePoint stops appearing in answers within minutes via webhook invalidation. Sensitivity labels (Microsoft Purview) are honoured throughout the pipeline.

What stops the model from hallucinating company facts?

Four defenses combined. (1) Confidence-thresholded refusal — below threshold the agent refuses rather than fabricates. (2) Citation enforcement — every claim must link to a real chunk ID retrieved during the query; fabricated citations fail server-side validation and the answer is regenerated with an explicit citation-required prompt. (3) Faithfulness scoring on sampled responses via the evaluation harness. (4) User-feedback loop converting negative ratings into new golden eval cases. Combined, these reduce production hallucination rates from typical 12-15% (unmitigated RAG) to under 2% in our deployments.

How does DECKLOG compare with Microsoft Copilot or Google Gemini for Workspace?

Different tools for different problems. Copilot and Gemini are general-purpose assistants embedded in productivity apps; they excel at writing, summarising, and tasks where the answer is generative. DECKLOG is a domain-specific retrieval system: the answer comes from your corpus with verifiable citations. They are complementary, not competitive. Many of our clients deploy both — Copilot for productivity, DECKLOG for "what is our actual policy on X" questions where citation-backed correctness matters more than generative fluency.

How long does it take to deploy?

Starter: 4-6 weeks from contract to first production query. Operate: 8-10 weeks for the multi-source-system rollout + quality dashboard. Sovereign: 12-16 weeks including self-hosted model deployment + tabletop exercises + custom Copilot Studio / Claude Project work. The bespoke DECKLOG Implementation service is the engagement shape; this product page describes the productised SaaS that wraps the same engine.

What about updates to the corpus? Will the index drift away from reality?

Webhook-driven invalidation handles the majority case: SharePoint / Drive / Confluence change events update the vector store within minutes. Nightly reconciliation walks the source systems vs the vector store side-by-side and flags drift above threshold. Combined, these catch the failure modes that pure webhook approaches miss (failed delivery, missed events, ACL changes the source system did not announce). The Operate + Sovereign tiers include drift-rate monitoring as a tracked KPI.

What evaluation metrics does DECKLOG track?

Five categories. (1) Retrieval: Recall@K and nDCG@K against the golden set. (2) Generation: faithfulness, completeness, groundedness via LLM-judged scoring. (3) End-to-end quality: human-validated sample of answers per quarter. (4) User signal: thumbs-up/down + free-text feedback with negative samples reviewed weekly. (5) Operations: median latency, p95 latency, cost per query, error rate. Dashboards integrate with your existing Grafana / Datadog / Sentry stacks.

What happens if we want to leave?

The vector store, the index, the configuration, and the trace history are all in your cloud account from day one. The provider abstraction layer means you can switch generation providers (Anthropic ↔ OpenAI ↔ self-hosted) without a rebuild. If you cancel, the integration code remains yours under a permissive internal-use license. The corpus stays where it always was — in your source systems. See DECKLOG Implementation for the bespoke variant if you want the deployment owned outright rather than consumed as SaaS.

Stop losing knowledge in the corpus. Find it. With citations.

30-minute discovery call. We map your knowledge corpus + tell you whether DECKLOG is the right shape for your top three use cases. No obligation. No high-pressure pitch.

Book discovery call Implementation service

Prefer written scope first? Email us

Product · Private RAG for Corporate Knowledge· v2 · Production

DECKLOG.
Your knowledge. Your perimeter. Your answer.

See tiers + pricing Talk to engineering

EU Data Boundary verifiedSource-system ACL enforced at query timeGDPR / DORA / NIS2 evidence-ready

Documents leaving your tenant — RAG runs in your perimeter

100%

Answers grounded with verifiable source citations

<3 sec

Median answer latency on production-tier deployments

Quarterly

Evaluation refresh catches regressions before users do

The bleed

Your knowledge is the asset. Your search is the failure mode.

Knowledge scattered across six systems nobody searches

Public LLMs leak your data and hallucinate your facts

Bolt-on AI that retrieves the wrong document with confidence

Architecture · Every Layer

What ships inside DECKLOG.

Layer 01

Source ingestion + ACL inheritance

Microsoft 365: SharePoint Online, OneDrive, Teams, Loop, Exchange — ACL inherited from source at ingest
Google Workspace: Drive, Docs, Sheets, Slides, Sites with Drive sharing-level enforcement
Confluence Cloud + Data Center + Notion + Slite + Slab with workspace-permission mapping
ServiceNow knowledge base + Zendesk Help Center + GitHub Wiki + GitLab Wiki
Slack + Teams chat history (ACL-aware, selective inclusion per channel policy)
Custom: PDF corpora, internal CMS exports, S3-resident document archives

Layer 02

Retrieval pipeline

Hybrid retrieval (BM25 + dense vector) — catches both semantic similarity + exact-identifier lookups
Reranking with Cohere Rerank 3 (managed) or bge-reranker-v2-m3 (self-hosted) — 95% quality parity
Per-query ACL re-validation: group memberships expanded at query time, never trust stored permissions alone
Sensitivity-label honouring (Microsoft Purview integration where licensed)
Time-aware retrieval: newer documents weighted; deprecated documents excluded
Multi-hop retrieval for questions that span multiple documents (Sovereign tier)

Layer 03

Generation + grounding

Generation: Claude Sonnet 4.6 (Bedrock EU) or GPT-4o (Azure OpenAI EU) or self-hosted Llama 3.1 70B FP8
Citation enforcement: every claim links to a retrieved chunk ID; fabricated citations fail validation server-side
Confidence-thresholded refusal: below threshold the agent refuses rather than fabricates
Prompt caching on the system prompt + retrieved context: 70-90% cache hit on multi-turn conversations
Provider abstraction: switching from Anthropic to OpenAI to self-hosted is a config change, not a rewrite
Per-query cost + latency telemetry tied back to the source documents that drove the answer

Layer 04

Evaluation harness

Golden question set built collaboratively with subject-matter experts in the first 4 weeks
Automated retrieval evals (Recall@K, nDCG@K) run on every commit to retrieval code
Automated answer evals (LLM-judged faithfulness, groundedness, completeness) on the golden set
User-feedback loop (thumbs + free-text) sampled into review queue for engineering attention
Regression detection: a model upgrade that drops eval score above tolerance cannot be merged
Quarterly eval refresh: new golden questions, retired stale ones, scored against current performance

Layer 05

Deployment + observability

Deployment surfaces: Microsoft Teams app, Slack bot, web UI hosted in your tenant, browser extension, API access
Langfuse self-hosted in your tenant for trace storage + cost analysis + eval datasets
Vector store options: pgvector (Postgres-native), Qdrant (managed or self-hosted), Pinecone (managed)
Webhook-driven invalidation: SharePoint / Drive / Confluence changes propagate within minutes
Nightly reconciliation: walks source systems vs vector store; flags drift above threshold
GDPR Article 17 right-to-erasure honoured with documented deletion SLA

Engagement Tiers · Pick your depth

Start with one team. Operate across the corpus. Scale to sovereign.

Three productised tiers. Same retrieval pipeline, same evaluation harness. The difference is breadth of source systems + retention + compliance grade.

Tier

Starter

from €1,990 / monthPer tenant + storage tier

Best for

Single-team pilot (HR, Legal, Ops, or Finance) over a focused corpus of <10k documents from one source system.

One source-system connector (SharePoint OR Drive OR Confluence OR Notion)
Up to 10,000 documents indexed
Hybrid retrieval + reranking
Deployment to Teams OR Slack OR web UI
Golden question set with 50 SME-validated questions
30-day post-deploy support window

Outcome

Single team finds answers in seconds; "where is X policy" Slack questions to senior staff drop 60-70%.

Request Starter

Operate

from €4,990 / monthPer tenant + storage tier

Best for

Multi-team operations across HR + Legal + Ops + Finance, multiple source systems, with monthly content growth.

Up to 4 source-system connectors
Up to 50,000 documents indexed
Sensitivity-label honouring (Microsoft Purview integration)
Multi-surface deployment (Teams + Slack + web UI)
Monthly quality dashboard with user-feedback analysis
Quarterly evaluation refresh with new golden questions
Named AI engineer + same-day triage on quality issues

Outcome

Cross-functional knowledge layer; new-hire ramp shortens 3-4x; senior staff stop being interruption-driven.

Request Operate

Tier

Sovereign

from €9,990 / monthPer tenant + storage tier

Best for

Regulated entities (DORA / NIS2 financial services, healthcare, legal practices, public sector) requiring audit-grade RAG with EU sovereignty and 7-year retention.

Everything in Operate
EU Data Boundary verification across model + storage tiers
Self-hosted open-weights option (Llama 3.1 70B FP8 or Qwen 2.5 72B on your infra)
Per-query audit log with regulatory retention (7 years configurable)
Multi-hop retrieval for cross-document regulatory analysis
Custom Copilot Studio / Claude Project for specific business processes
Tabletop exercises for AI-incident response
Dedicated AI Architect + monthly executive review

Outcome

Audit-grade RAG with full evidence chain; defensible posture under DORA + NIS2 supervisory review.

Request Sovereign

Looking for a bespoke implementation owned outright instead of consumed as SaaS? DECKLOG Implementation · from €4,990

Native Connectors

Every source system DECKLOG already speaks.

Microsoft 365, Google Workspace, Atlassian, Notion, ServiceNow, GitHub — the corpus surfaces where modern enterprises actually store knowledge. ACL-aware, webhook-driven, reconciled nightly.

Microsoft SharePoint Online

Native Graph API + sharing-permission inheritance + sensitivity-label honouring

Microsoft OneDrive + Teams + Loop

Personal + shared + collaborative-canvas content fully covered

Google Workspace Drive + Docs

Drive sharing model + Workspace permission groups respected at query time

Confluence Cloud + Data Center

Atlassian workspace permissions mapped, page-level ACL preserved

Notion

Teamspace + page-level permission inheritance with API-driven sync

ServiceNow KB + Zendesk Help Center

Knowledge-base articles with view-permission enforcement

Slack + Microsoft Teams (chat)

Selective channel ingestion with per-channel policy

GitHub + GitLab Wiki

Engineering-doc surfaces with org-membership ACL

Custom PDF + CMS exports

Bulk-ingest pipeline with manual ACL mapping for legacy archives

Azure OpenAI EU + AWS Bedrock EU

Default generation providers in EU regions with Customer Lockbox

Self-hosted Llama / Qwen / Mistral

Sovereign-tier option via vLLM runtime on your bare-metal

Langfuse (self-hosted)

Trace + eval + cost observability inside your perimeter

Where DECKLOG runs

Your tenant. Your region. Your choice of generation provider.

Managed EU (default)

Hybrid sovereign

Full sovereign (self-hosted)

EU Regulatory Alignment

DORA. NIS2. GDPR. ISO 27001. Evidence-ready by design.

DORA Article 28

Third-party ICT-risk evidence: model provider, hosting region, sub-processors, exit strategy, concentration analysis.

NIS2 Article 21

Technical baseline: PII redaction at boundary, MFA-bound query surface, audit logging, incident-response integration.

GDPR Articles 17 + 30

Right-to-erasure SLA documented; records of processing produced automatically with sub-processor + retention disclosure.

ISO 27001 Annex A

A.5 organisational + A.8 technological controls mapped per workflow with timestamped evidence chain.

For Your Engineering Team

Measurable. Improvable. Portable.

Eval harness in your CI

Traces in Langfuse

Every query: which documents were retrieved, what the model decided, what the user received, how long it took, what it cost. Langfuse runs self-hosted in your tenant; no third-party trace store.

Clean exit kit

Existing customer · Open portal Talk to engineering

Buyer Questions

The questions knowledge-management leads ask before purchase.

Do our documents leave the perimeter?

How does DECKLOG handle permissions on confidential documents?

What stops the model from hallucinating company facts?

How does DECKLOG compare with Microsoft Copilot or Google Gemini for Workspace?

How long does it take to deploy?

What about updates to the corpus? Will the index drift away from reality?

What evaluation metrics does DECKLOG track?

What happens if we want to leave?

Stop losing knowledge in the corpus. Find it. With citations.

30-minute discovery call. We map your knowledge corpus + tell you whether DECKLOG is the right shape for your top three use cases. No obligation. No high-pressure pitch.

Book discovery call Implementation service

Prefer written scope first? Email us