image-analysis-router
Health Uyari
- No license — Repository has no license file
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Low visibility — Only 6 GitHub stars
Code Gecti
- Code scan — Scanned 5 files during light audit, no dangerous patterns found
Permissions Gecti
- Permissions — No dangerous permissions requested
Bu listing icin henuz AI raporu yok.
智能选择 17 套分析方案的图像分析分流 Skill:先判断该怎么读图,再输出定向分析和学习建议。
Image Analysis Router
中文文档 | English
🎯 This skill does not treat every image like the same homework. It decides how the image should be read first, then returns a targeted critique and a practical study plan.
✨ Features
| Stage | Function | Description |
|---|---|---|
| 1️⃣ | Smart Routing | Read the request, filenames, and hints first to narrow down the right lens |
| 2️⃣ | Visual Check | Treat the script as a prior, then confirm with actual visual evidence |
| 3️⃣ | Targeted Analysis | Use one of 17 routes instead of forcing one standard onto every image |
| 4️⃣ | Dual Output | Return both an Analysis Report and a Study Report |
🚀 Quick Start
Prerequisites
# 1. Python for the local routing script
py --version
# 2. An AI agent environment that can read local skills
# For example: an environment that can load SKILL.md, references/, scripts/, and image input
# 3. Image input
# Local images, screenshots, filenames, OCR text, or image-heavy requests with context
Usage
🧩 This is a local skill for OpenAI- and Claude-based agent environments. Put the whole folder into your skill directory:
mkdir -p "$CODEX_HOME/skills"
cp -r ./image-analysis-router "$CODEX_HOME/skills/image-analysis-router"
📁 If your agent uses a different skill folder, such as .agents/skills/, replace the target path and keep the same folder structure.
💬 Then call the skill with a real image request, for example:
"Use
image-analysis-routerto review this poster. Focus on hierarchy and typography."
"Break down this batch of interior renders and tell me what to practice next."
"Figure out which route fits this slide first, then give me a study report."
🔎 If you only have text clues, filenames, or OCR, run the routing script first:
py .\scripts\route_image_request.py --prompt "Analyze this tower facade and check how it meets the street" --file "tower-facade-render.jpg"
🧭 How Routing Works
🛣️ The skill answers one question before anything else: what is the right way to read this image?
Image request
↓
[1️⃣ Text and filename pass] ──→ route_image_request.py generates a routing prior
↓
[2️⃣ Visual confirmation] ──→ confirm the main route or split the batch
↓
[3️⃣ Method selection] ──→ choose 1 of 17 specialized routes
↓
[4️⃣ Final output] ──→ Analysis Report + Study Report
Route Confidence
🧠 The skill marks how sure it is about the route before moving on:
| Level | Meaning |
|---|---|
high |
The image type and user goal point clearly to one route |
medium |
One route leads, but another one still makes sense |
low |
The batch is mixed, the clues are thin, or the images need regrouping first |
🗂️ Route Coverage
🖍️ The skill currently ships with 17 routes for common image-reading jobs.
Design and communication
graphic-design: posters, brand visuals, packaging, UI screenshots, ad creativesinfographic-diagram: charts, maps, process diagrams, information graphicstypography-lettering: type posters, lettering, logotypes, calligraphy, letterform studypresentation-document: slides, report pages, proposal pages, document spreads
Image and narrative
photography: documentary, portrait, street, editorial, commercial photographyfilm-frame: movie stills, animation frames, storyboard shots, cinematic compositionscomics-sequential: comic pages, manga, strips, webtoon panels, sequence storytellinggame-visual-design: game UI, HUD, level screenshots, character panels
Art and space
painting-illustration: paintings, illustrations, concept art, stylized image workinterior-design: interior renders, room photos, material and furniture studiesarchitecture-urban: facades, street views, public space, urban and site relationshipssculpture-installation-craft: sculpture, installation, ceramics, craft-based 3D work
Objects and specialist imagery
product-industrial-design: products, prototypes, object form, packaging structurefashion-styling: outfits, silhouettes, accessories, lookbooks, styling visualsscientific-medical-imaging: medical scans, microscopy, technical and research imagery
Fallback routes
generic-mixed: mixed batches that need grouping before critiqueuniversal-fallback: images that do not fit cleanly anywhere else but still need a structured read
🧠 What You Get
📌 Every run starts with a route decision: which route won, how confident the skill is, and why that route fits better than the alternatives.
📝 The Analysis Report gives a quick read, a route-specific breakdown, and a final judgment about what the image is trying to do and where it actually lands.
📚 The Study Report turns that into next steps: what to learn now, what drills to run, what to watch next time, and what comparisons are worth making.
📁 Project Structure
image-analysis-router/
├── SKILL.md
├── README.md
├── README.zh-CN.md
├── agents/
│ └── openai.yaml
├── scripts/
│ └── route_image_request.py
└── references/
├── route-matrix.md
├── output-contract.md
└── method-*.md
⚙️ Configuration
🛠️ The skill does not require a separate config file for the default workflow.
🔤 If you only want a local routing guess before the full image review, you can call the script directly:
py .\scripts\route_image_request.py --prompt "<user goal>" --file "<file name or path>" --hint "<OCR or extra clue>"
📎 Parameters:
--prompt: the user's goal or question--file: image file name or path, repeatable--hint: OCR text, title, note, or any extra clue, repeatable
📄 Output
📦 A normal run gives you two core outputs:
| Output | Content |
|---|---|
Analysis Report |
route decision, key findings, detailed critique, overall judgment |
Study Report |
learning focus, drills, likely mistakes, next-step practice |
🧪 For batches, the skill can also comment on set-level consistency, before/after changes, and whether the images should be split into subgroups first.
⚠️ Ground Rules
🧱 The script output is a starting point, not the final answer.
👀 Important judgments have to go back to visible evidence.
📐 Different image types should not be judged with the same standard.
🤝 When the route is unclear, the skill should say so instead of bluffing.
✅ Supported Environments
💻 This skill works best in AI agent environments that can load local skill folders and accept image input.
| Environment | Status |
|---|---|
Agents that can read SKILL.md and local reference files |
✅ Supported |
| Agents that can inspect screenshots, local images, or image attachments | ✅ Supported |
| Text-only chat environments with no access to local skill files | ⚠️ Limited |
🔗 Related Files
📚 Key files in this repo:
Yorumlar (0)
Yorum birakmak icin giris yap.
Yorum birakSonuc bulunamadi