Skip to main content

How to create datasets

Datasets are critical assets for building robust prompts, evals, fine-tuning, and much more. Phoenix allows you to build datasets manually, programmatically, or from files.

Create datasets from CSV

Create datasets from JSONL

Create datasets from Pandas

Create datasets from spans

Create datasets using synthetic data

Exporting datasets

Export datasets for offline analysis, evals, and fine-tuning.
  • Exporting to CSV - how to quickly download a dataset to use elsewhere
  • Exporting to OpenAI Ft - want to fine tune an LLM for better accuracy and cost? Export llm examples for fine-tuning.
  • Exporting to OpenAI Evals - have some good examples to use for benchmarking of llms using OpenAI evals? export to OpenAI evals format.