Skip to main content
All CollectionsUnderstanding DashboardTrain Your AI PersonalityTrain By Private Data
Crafting Your AI Personality: Creating the Perfect Training Data File
Crafting Your AI Personality: Creating the Perfect Training Data File

This article helps the users to create a well-thought-out information file for training their AI personality or character with private data

Kanishka Sharma avatar
Written by Kanishka Sharma
Updated over a week ago

Welcome to Kamoto.AI's world of limitless customization, where you have the power to shape your AI Personality's responses like never before. The "Train by Private Data" feature lets you train your AI with exclusive, private information, ensuring a unique and tailored conversational experience. In this guide, we'll walk you through the process of creating the perfect training data file that will empower your AI Personality to shine.

The Significance of Quality Training Data in AI

Training your AI Personality with private data is all about enriching its knowledge and responses. It's a chance to infuse your AI with specific insights, making it even more valuable and engaging for your users.

The quality and structure of your Training Data File are vital in shaping the behavior and responses of your AI Personality. This principle holds true for all AI and Language Learning Models (LLMs). Industry experts often say that 80-90% of an AI's effectiveness hinges on the quality and structure of its training data.

Therefore, please carefully adhere to the following simple guidelines to ensure optimal training data quality. Since the process is fully automated, Kamoto.AI's role in enhancing training is minimal. The key to effective AI training lies in the quality of your training data files. Your input significantly shapes the performance and behavior of the AI personality.

Here's how to do it right:

File Format

  • Your training data file must be in plain text format, with the file extension .txt or .text. No other formats, including .rtf, will be accepted.

Paragraph Breakdown

  • Break your content into paragraphs, ideally containing 5 to 500 words each. While it's possible to exceed 500 words in each paragraph, we generally do not recommend doing so for optimal results. This segmentation allows your AI to process information more effectively.

Content Quality

  • Each paragraph should deliver a substantial knowledge chunk for the AI to consider while answering relevant questions. Quality matters more than quantity.

Separation with Empty Lines

  • Crucially, each paragraph must be separated by an empty line. This clear division is essential for the AI to recognize the start of a new piece of information.

Content Variety

  • Your paragraphs can take various forms, such as simple text, question and answer pairs, or prompts and responses. Feel free to mix it up to achieve the desired conversational depth.
    ​

  • Example of a simple paragraph of knowledge

    In the heart of the enchanted forest lived Elara, a young witch with an affinity for nature. She had a rare gift to communicate with animals, which made her a trusted friend among the creatures of the woods.

  • Example of a question and answer-based paragraph

    Question: How did Elara discover her ability? 
    Answer: Elara's grandmother, a wise old witch, recognized her unique connection with animals from a young age. She guided Elara in nurturing this gift, teaching her the secrets of animal speech.

  • Example of a simple prompt-response-based paragraph

    Prompt: Describe the most exotic flower in Elara's garden. 
    Response: The Lumina Orchid, with petals that glowed in the moonlight, was the most coveted flower. Legends claimed that its nectar held the secret to eternal youth.

File Size

  • Keep the total size of your training data file under 2 MB. This ensures efficient processing and optimal performance.

Language Choice

  • For the best outcomes, it's advisable to use English in your training data. The AI's proficiency in generating responses is highest in English. Your AI Personality or Character is designed to respond primarily in the language in which questions are posed, irrespective of the language used in the training data.

Quantity of Paragraphs

  • There's no limit to the number of paragraphs you can include in your file, as long as you stay within the 2 MB size limit. Feel free to pack it with valuable insights.

Focus on Knowledge, Not Grammar

  • Emphasize the quality of the training data's content rather than its grammar or language intricacies. The AI learns from the knowledge you provide.

Sample Data File for Training

  • The user may download the sample file crafted to deliver a glimpse of the standard document needed to be submitted in order to train the AI personality perfectly -
    ​

Why Quality Training Data Matters

  • Customization: Crafting a well-structured training data file allows you to tailor your AI's responses to your exact requirements.

  • Engagement: Quality data enhances the AI's ability to engage users effectively, making their interactions more meaningful.

  • Uniqueness: With private data, your AI Personality stands out as a one-of-a-kind conversational partner, offering exclusive insights and responses.

Creating the perfect training data file for your AI Personality is an art that you can master. The quality and structure of your data directly impact the AI's performance, making your interactions with users more impactful and memorable.

Get started today, and watch your AI Personality come to life with the unique knowledge and insights you provide. Your efforts in crafting quality training data will elevate your AI's conversational prowess and create a more engaging and personalized experience for all who interact with it.


​

Did this answer your question?