Publications

White papers and technical write-ups from Xtensyon Labs. Practical notes on building production AI, governance, evaluation, and the unglamorous parts that make systems work.

Showing 10 of 14

Showper page|Year

2026

February 10, 2026 • Case Study • 9 min read

Xtensyon Labs, Platform EngineeringPDF

Deploying LLMs On-Prem and in Private Cloud: Cost, Compliance, and Reliability Considerations

Some teams cannot send sensitive data to third-party APIs, and network routes are not always predictable. This case study shares a private deployment pattern aimed at strong controls, clear cost reporting, and operational reliability.

on-prem llmprivate clouddata residencyenterprise securityGPU cost optimizationhigh availability

February 5, 2026 • Technical Brief • 7 min read

Xtensyon Labs, Applied AI TeamPDF

Enterprise Evaluations for RAG: Scorecards, Golden Sets, and Release Gates

Most RAG projects fail quietly: the demo works, then accuracy drifts after a few indexing tweaks. This brief introduces a scorecard approach with simple metrics, a golden query set, and release gates to keep enterprise RAG systems stable in production.

RAG evaluationLLM testinggolden datasetprompt regressionAI quality assurancehallucination detection

January 28, 2026 • White Paper • 8 min read

Xtensyon LabsPDF

Production RAG Governance: A Practical Playbook for Enterprises

RAG can make LLM outputs auditable and grounded, but only if retrieval quality, policy controls, and monitoring are treated as first-class production concerns. This playbook outlines a governance pattern teams can adopt in weeks, not quarters.

retrieval-augmented generationRAG governanceLLM observabilityAI risk managemententerprise AImodel monitoring

2025

December 19, 2025 • Technical Brief • 7 min read

Xtensyon LabsPDF

RAG Latency Budgeting: Hitting Response Targets Without Cutting Corners

RAG pipelines get slow for predictable reasons: parsing, retrieval, reranking, and long generations. This brief shows how to budget latency across steps and choose optimizations that do not reduce trust.

latencyragperformanceobservabilitycachingreranking

December 3, 2025 • White Paper • 9 min read

Xtensyon LabsPDF

Identity Sync for Permission-Aware Retrieval: Getting ACLs Right in Practice

Permission-aware retrieval fails when group memberships drift or metadata is inconsistent. This paper covers a practical identity sync and ACL strategy that keeps retrieval correct without slowing teams down.

access controlaclidentity syncragsecurityenterprise search

November 6, 2025 • Technical Brief • 7 min read

Xtensyon LabsPDF

Building Evaluation Sets from Real Tickets: A Practical Alternative to Benchmarks

Public benchmarks rarely match internal workflows. This brief shows how to turn a set of real tickets into a repeatable evaluation suite that catches regressions after prompt or indexing changes.

evaluationllm testingsupport ticketsqualityragregression

October 14, 2025 • Technical Brief • 6 min read

Xtensyon LabsPDF

Document Versioning for Policy Teams: Preventing Wrong Answers from Old Guidelines

Policy teams update documents often, but users keep bookmarking the old files. This brief shows a versioning and canonical link approach that reduces confusion and improves citations in RAG systems.

document versioningpolicygovernanceRAGcomplianceknowledge management

September 9, 2025 • Case Study • 9 min read

Xtensyon LabsPDF

SAP Automation with LLM Guardrails: Reducing Rework in Ticket-Based Operations

We share a pattern for automating repetitive SAP operations requests without letting the model act freely. The focus is on approvals, safe tool scopes, and clear fallbacks when data is incomplete.

sapautomationllmguardrailsoperationsworkflow

August 26, 2025 • Technical Brief • 7 min read

Xtensyon LabsPDF

Multilingual Enterprise Search: Handling Mixed-Language Queries with Consistent Relevance

Search quality drops when users mix languages, abbreviations, and technical terms in one query. This brief covers indexing, normalization, and evaluation methods that improve recall without punishing precise keyword searches.

multilingual searchmixed-language queriesembeddingsinformation retrievalenterprise searchevaluation

July 3, 2025 • White Paper • 10 min read

Xtensyon LabsPDF

Prompt Injection in Enterprise Systems: Threats, Tests, and Practical Defenses

Prompt injection is not a theory problem. It shows up through emails, PDFs, tickets, and chat logs. This paper lays out a hardening checklist that security teams can validate and engineers can ship.

prompt injectionsecurityllm appsecragthreat modelingpolicy

Page 1 of 2

Previous Next

Loading publications…