Data Studio
Last updated
Last updated
Uploading files to the Data Studio allows users to create datasets that can be utilized as memory for contextualizing their agents' workflows and shared with the community
To create a dataset in the Data Studio, users should:
Access the Data Studio: Navigate to the Data Studio interface.
Name the Dataset: Provide a name for the dataset and add a short description.
Upload Files: Choose the desired files from local storage. The platform supports bulk uploads and drag-and-drop functionality.
Include Scraped Internet Data: Select this option if desired. Users can also specify sources by entering URLs for web scraping from specific sources.
Create the Dataset: Once you click "Create," the system will generate a dataset along with an associated vector knowledge graph, enabling agents to leverage the newly created data effectively
File Formats Supported
The Data Hub supports a wide range of file formats, including PDF, DOC, XLSX, CSV, JSON, TXT, XML, and more, ensuring broad compatibility with various data types and sources.
Metadata and Categorization
Adding Metadata: Users can add metadata to describe the contents of their files. Metadata fields include title, description, keywords, source information, and date of creation. This information helps in identifying and categorizing the data for future retrieval.
Categorization: Uploaded files can be categorized based on their content, such as by topic, industry, or data type. This categorization aids in organizing the data within the Data Hub, making it easier for other users to find and utilize the datasets.
Post-Upload Processing
Data Validation: Once uploaded, files undergo a validation process to check for any errors or inconsistencies. This includes format checks, data integrity checks, and ensuring that the data adheres to the specified metadata.
Integration with Knowledge Graphs: Validated data is then integrated into the vector knowledge graphs, making it accessible for AI agents and workflows. This process involves mapping the data to relevant entities and relationships within the graph.
User Permissions and Access Control
Setting Permissions: Users can set permissions for their uploaded files, determining who can view or edit the data. Options include public access, restricted access to specific users or groups, and private access.
Access Control: The Data Hub enforces strict access control policies to ensure data security and privacy. Users can manage access rights and monitor who has accessed their data.