Fox Software Solutions

Knowledge Base Query Report

Generated 12 March 2026
Profilerag-report
Fox Software Solutions
Document Summary

Knowledge Base Query Report

Analysis of 47 benchmark queries against a 312-document knowledge base reveals 84% semantic coverage. Six content gaps identified where queries return low-confidence results. Retrieval averaging 22ms. Recommendations to close gaps and improve chunking strategy for 3 document types.

Knowledge Base Query Report
6Sections
501Words

Table of Contents

Knowledge Base Query Report

A semantic benchmark run of 47 queries against the indexed collection has completed. The knowledge base covers 312 documents, 8,941 chunks, with retrieval averaging 22ms. Coverage is strong at 84% — but 6 query categories return low-confidence results, indicating content gaps or chunking issues.

312Documents Indexed
8,941Chunks Indexed
22msAvg Retrieval Time
84%Coverage Score

Attention Required

6 query categories returning low-confidence results. These represent either missing content or documents that are not chunking well — long tables, scanned PDFs, or heavily formatted files.

  1. Leave entitlements — 3 queries, max similarity 0.41. Content may exist but is buried in HR handbook appendices.
  2. Emergency procedures — 2 queries, max similarity 0.38. Likely in a scanned PDF not processing correctly.
  3. Contractor onboarding — 4 queries, max similarity 0.44. No dedicated document found. Gap confirmed.

Query Results by Category

HR Policies
12
Compliance & Legal
8
Finance Procedures
7
IT & Systems
6
Leave & Entitlements
5
Emergency Procedures
4
Contractor Management
5

HR, compliance, and finance show strong retrieval. Leave, emergency, and contractor categories are the problem areas.


Retrieval Performance Distribution

< 10ms
8
10–25ms
31
25–50ms
6
> 50ms
2

83% of queries complete under 25ms. The two outliers above 50ms both involve multi-document synthesis queries — expected behaviour.


Document Processing Issues

Processing Well

  • Standard Word documents (.docx)
  • Plain text policies (.txt, .md)
  • Native PDFs with selectable text
  • Structured HTML exports
  • Spreadsheets with clear headers

Processing Poorly

  • Scanned PDFs (OCR quality variable)
  • Documents with large embedded tables
  • Files with heavy header/footer repetition
  • Multi-column layouts losing reading order
  • Password-protected files (skipped entirely)

Content Gap Analysis

Critical
Contractor Onboarding

No document covers contractor onboarding process. 4 queries all return < 0.45 similarity. Document needs to be created.

Critical
Emergency Procedures

Emergency evacuation procedure exists but is a scanned PDF. Re-ingest as native PDF or Word document.

Warning
Leave Entitlements

Content exists in HR Handbook appendix (pages 34–41) but appendix is a separate embedded file. Extract and index separately.

Warning
Parental Leave Policy

Referenced in 3 other documents but no standalone policy document found.

Success
IT Security Awareness

6 months out of date. Queries return old content confidently. Flag for review.

Success
Finance Approval Thresholds

Table-heavy document chunking poorly. Split into separate documents by approval level.


Critical
Create contractor onboarding document

Draft new document covering induction, access provisioning, and compliance requirements. Index immediately.

Critical
Re-process emergency procedures PDF

Convert scanned PDF to Word, re-index. Estimated retrieval improvement from 0.38 → 0.75+.

Critical
Extract HR Handbook appendices

Split appendix sections into individual indexed documents. Improves leave query coverage significantly.

Warning
Create parental leave standalone policy

Consolidate scattered references into a single authoritative document.

Warning
Adjust chunk size for table-heavy documents

Reduce chunk size from 512 to 256 tokens for finance approval documents. Reindex these 4 files only.

Success
Schedule quarterly re-index

Set automated re-index to catch updated documents. IT Security doc currently 6 months stale.


VerdictSummary
Rating Label
strong Bottom Line

The knowledge base is performing well where content exists. The 84% coverage score reflects genuine content gaps, not system failures. Three high-priority actions — creating the contractor onboarding document, re-processing the emergency procedures PDF, and extracting HR appendices — will push coverage above 95%. The retrieval speed is already strong at 22ms average and requires no tuning. Once gaps are addressed, this collection is ready for production use.

Fox Software Solutions
Document
SubtitleRAG System Analysis & Retrieval Insights
Profilerag-report
CollectionCompany Policy & Procedures
Documents Indexed312
Chunks Indexed8941
Queries Run47
Avg Retrieval Ms22
Coverage Score84%
Gaps Identified6
Tagsrag, knowledge-base, semantic-search, document-intelligence