Config¶
Pipeline configuration and constants.
PipelineConfig(output_dir=Path('data'), source_cache_dir=Path('data/source_cache'), repos_dir=Path('data/repos'), github_token='', openai_api_key='', distillation_model='gpt-5.4', distillation_base_url=None, swebench_dataset='princeton-nlp/SWE-bench', splits=(lambda: ['test'])(), max_instances=None, min_tools_per_instance=3, max_tools_per_instance=7, max_tool_output_lines=MAX_TOOL_OUTPUT_LINES, distillation_max_concurrent=50, distillation_temperature=0.3, command_timeout=30)
dataclass
¶
Configuration for the data generation pipeline.
Constants¶
config
¶
Configuration for the data generation pipeline.