Understanding Trainings
Training in Chatmode uses vector embeddings and retrieval-augmented generation (RAG) to improve your agent’s responses:- Your training data is processed and converted into vector embeddings
- When a user asks a question, the system searches for relevant information in your training data
- The agent uses this retrieved information to generate more accurate and informed responses
Types of Training Data
Chatmode supports the following types of training data sources:Direct Text Input (TEXT
)
Manually enter or paste text content directly, such as:
- Product descriptions
- FAQs
- Company policies
- Troubleshooting guides
- Content extracted from documents (PDF, Word, etc.) manually or via other tools.
Website URL (URL
)
Provide a single URL to a specific web page. The system will attempt to extract the main textual content from that page for training.
Google Shopping XML (GOOGLE_SHOPPING_XML
)
Provide a URL to your Google Shopping XML feed. The system will extract product information from the feed and use it for training. This type is optimized for product catalogs and uses item-level chunking for better indexing.
Currently, direct file uploads (PDF, DOCX) and full website crawling are not
supported within the training creation process itself. Content from files
needs to be extracted and provided as text.
Creating a New Training
To add training data:- Navigate to the Trainings section in the sidebar
- Click the New Training button
- Give your training a descriptive Name.
- Select the Data Source Type (
TEXT
orURL
orGOOGLE_SHOPPING_XML
). - Provide the Content (either paste text or enter a URL).
- Select the API Key (e.g., OpenAI) to use for generating embeddings for this training data.
- Configure processing options (see below).
- Click Create Training to start the asynchronous processing.
Training Configuration
When creating a training, you can configure:Content Processing
- Chunk Length: The approximate maximum size (in characters or tokens) of each piece the content is divided into for embedding. Default: 1000.
- Chunk Overlap: How much content (in characters or tokens) should overlap between consecutive chunks to maintain context. Default: 200.
Managing Trainings
Training Status
After initiating a training, you can monitor its status in the training list:PENDING
: Training is queued for processing.PROCESSING
: Content is being extracted and embeddings are being generated.COMPLETED
: Training is fully processed and ready to be associated with agents.FAILED
: Processing encountered an error. Check logs or try again.
Training Details
View basic information about your trainings in the list:- Name
- Type (
TEXT
,URL
, orGOOGLE_SHOPPING_XML
) - Status
- Creation Date
Training Best Practices
For optimal results with training:- Focus on Quality: Provide accurate, well-structured information
- Be Comprehensive: Cover common questions and scenarios
- Update Regularly: Keep your training data current as information changes
- Structure Appropriately: Organize content logically with clear headings
- Test Thoroughly: Verify improvements with realistic queries after training
Content Guidelines
- Break information into clear sections with descriptive headings
- Include common questions and their answers
- Provide specific examples and use cases
- Use consistent terminology throughout
- Avoid heavily formatted content when possible
Example Training Workflows
Product Knowledge Base
- Upload product manuals, specifications, and FAQs
- Add website URLs for product pages
- Include common customer questions and appropriate answers
- Train your agent to handle product inquiries
Internal Policies
- Upload company policy documents
- Add procedure guides and employee handbooks
- Include common HR questions and appropriate responses
- Train your agent to assist employees with policy questions
Evaluating Training Effectiveness
After adding training, evaluate its impact:- Test Queries: Try asking questions that should be answered with your training data
- Review Conversations: Examine real user interactions to identify knowledge gaps
- Track Metrics (if applicable): Monitor changes in resolution rates and user satisfaction
- Compare Responses: Note differences in agent responses before and after training
Troubleshooting
Common issues and solutions:Issue | Possible Solutions |
---|---|
Agent not using training data | Check if training is COMPLETED and associated with the correct agent(s) |
Incorrect information in responses | Review the original source content used for the specific training |
Slow processing | Training is processed asynchronously. Large inputs may take time. |
Training failed | Check the validity of the URL (if used) or the API Key selected. Simplify content and retry. |
Next Steps
After setting up trainings:- Test your agent with questions related to your training content
- Monitor conversations to evaluate training effectiveness
- Update your agent’s prompt to leverage the training data effectively