Document Guidelines
The content and structure of a document are key factors in retrieving accurate answers to queries.
1. Document Content
Composition
-
Each document is a logical collection of attachments (files).
Supported Formats
-
PDF (must meet accessibility standards)
-
Text (.txt)
Language
-
Only English is supported.
Size & Limits
-
Each attachment: up to 300 pages
-
Each document: up to 500 attachments
-
Total size across all attachments: ≤ 2 GB
Limitations
-
Multi-modal questions not supported:
-
Only text-based content will be answered.
-
Questions about pictures, graphics, charts, etc. will not be answered.
-
-
Complex tables spanning multiple pages are not supported.
2. Document Structure
Importance
A clear structure allows the agent to split the document into semantically meaningful chunks, improving the quality of answers.
Best Practices
-
Use headings and subheadings to organize content.
-
Headings must clearly describe the meaning of their section.
-
Each section should contain related concepts only.
-
If multiple concepts are mixed, split into new sections.
-
-
Ensure heading levels are visually distinguishable.
Example of Heading Levels
-
Heading 1 → Font size 12
-
Heading 2 → Font size 10
-
Heading 3 → Font size 8
-
(and so on…)
Comments
Post a Comment