Arboreto (Claude Skill)
Claude skill that drives Arboreto for gene-regulatory-network (GRN) inference — fitting GRNBoost2 or GENIE3 regression models that identify transcription-factor → target-gene relationships from bulk or single-cell expression data.
| Type | Claude Skill |
| Supplier | K-Dense Inc. (community OSS) |
| Availability | GA — actively maintained 2025–2026 |
| Pricing | Free / OSS skill (MIT collection); Arboreto itself is BSD-3 |
| Capabilities | Read/Write — Claude executes Arboreto via Python/Bash; Dask handles parallelism |
How to install
- Also packaged in the SciAgent-Skills collection (jaechang-hits (community OSS, CC BY 4.0)): clone
jaechang-hits/SciAgent-Skillsand run/plugin install sciagent-skillsin Claude Code (or copyskills/genomics-bioinformatics/arboreto-grn-inferenceinto~/.claude/skills/). - Claude Code / Claude.ai — Skills CLI (recommended):
npx skills add K-Dense-AI/scientific-agent-skillsInstalls the K-Dense collection; enable the
arboretoskill when prompted (also works in Cursor/Codex via the Agent Skills spec; requires Node ≥ 18). - Claude Code / Claude Desktop — manual clone:
git clone https://github.com/K-Dense-AI/scientific-agent-skills cp -r scientific-agent-skills/skills/arboreto ~/.claude/skills/ pip install arboreto
Project-scoped alternative: copy into .claude/skills/ instead of ~/.claude/skills/.
What it does
SKILL.md with recipes for:
- GRNBoost2 — gradient-boosting GRN inference with self-tuning early stopping (the recommended default)
- GENIE3 — classical random-forest GRN inference for reproducing legacy analyses
- TF-restricted inference using a curated transcription-factor list
- Single-cell GRN inference (compatible with pySCENIC pre-processing)
- Distributed execution via Dask — single machine to multi-node clusters
- Importance-threshold filtering and ranking of TF–target edges
Primary use cases: Transcription-factor regulon discovery, single-cell GRN inference as a step in pySCENIC pipelines, regulator prioritisation for target-discovery and drug-repurposing studies.
Notes
Pairs with the scanpy, pydeseq2, and cellxgene-census skills — Arboreto consumes the (cells × genes) expression matrix produced by upstream QC. The Python script must include the standard if __name__ == "__main__": guard because Dask spawns child processes. Skill is documentation plus Python recipes — Claude calls Arboreto locally via Bash/Python.
Sources
K-Dense-AI/scientific-agent-skillsskills/arboreto/SKILL.md- Arboreto documentation
- Moerman et al. Bioinformatics 2019 (GRNBoost2)
Installed this tool?
Share feedback — install path, OS, errors, workarounds. The form opens with this tool pre-selected and a link back to this page.