This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.

a6947cff创建于 2025年12月17日27次提交

文件	最后提交记录	最后更新时间
arc_agi	output token counts reported by litellm (#10)	5 个月前
data	switch to kaggle format	6 个月前
.gitignore	switch to kaggle format	6 个月前
LICENSE.txt	Add MIT LICENSE.txt.	5 个月前
README.md	add link in readme	5 个月前
arc2captured.png	Readme update: add official results (#11) * update readme * readme tweak	5 个月前
arcagi1.png	Update readme (#2)	5 个月前
arcagi2.png	Update readme (#2)	5 个月前
main.py	output token counts reported by litellm (#10)	5 个月前
officialtable_boxed.png	Readme update: add official results (#11) * update readme * readme tweak	5 个月前
requirements.txt	Initial commit	6 个月前

Poetiq: SOTA Reasoning on ARC-AGI

This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.

Full analysis is available in our launch post, Traversing the Frontier of Superintelligence.

Our method is now on top of the official leaderboard. More information is in our follow-up post, Poetiq Shatters ARC-AGI-2 State of the Art at Half the Cost.

📊 Public Eval Results

You can recreate the Gemini 3 points from these charts using this repo.

These are our results on the official leaderboard from ARC Prize, but those problems are kept private.

Setup the environment:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Create a .env file in the root directory. You must include keys for the models you intend to run.
```
GEMINI_API_KEY=...
OPENAI_API_KEY=...
```
Modify the constants in main.py to set the problem set, number of problems, etc. Then run the script:
```
python main.py
```
By default, the code runs the Poetiq 3 config described in the blog post. You can uncomment other ones or modify the config in config.py