cybercode-cli

agent
Guvenlik Denetimi
Basarisiz
Health Uyari
  • License — License: NOASSERTION
  • No description — Repository has no description
  • Active repo — Last push 0 days ago
  • Low visibility — Only 5 GitHub stars
Code Basarisiz
  • spawnSync — Synchronous process spawning in bin/cli.mjs
  • process.env — Environment variable access in bin/cli.mjs
  • network request — Outbound network request in bin/cli.mjs
  • spawnSync — Synchronous process spawning in package/bin/cli.mjs
  • process.env — Environment variable access in package/bin/cli.mjs
  • network request — Outbound network request in package/bin/cli.mjs
  • child_process — Shell command execution capability in scripts/postinstall.cjs
  • execSync — Synchronous shell command execution in scripts/postinstall.cjs
  • process.env — Environment variable access in scripts/postinstall.cjs
  • fs module — File system access in scripts/postinstall.cjs
Permissions Gecti
  • Permissions — No dangerous permissions requested

Bu listing icin henuz AI raporu yok.

README.md

CyberCode

The All-in-One AI Agent Platform — Free GPT-5.5, Claude Opus 4.8 & GLM-5.2

License: MIT
Python 3.8+
Self-Contained
Streaming

A self-contained, physical-level AI Agent — directly drives frontier LLMs with 9 built-in system tools that read/write files, execute code, scan the web, and generate images & videos.

中文文档

Highly recommended: Linux.do — A genuine community for developers and tech enthusiasts. You'll find high-quality AI tool sharing, open-source project discussions, and cutting-edge tech news. CyberCode was born thanks to the inspiration and support of the Linux.do community. We sincerely invite all developers to join this pure, friendly, and deep technical community.


What is CyberCode

CyberCode is an AI Agent that runs in your local browser. It's not a chatbot shell — it's a doer with hands and feet that can actually operate your file system, run scripts, access the network, and generate multimedia content.

The key point: use GPT-5.5, Claude Opus 4.8, GLM-5.2, Gemini 3.1 Pro, DeepSeek V4 and 32+ frontier models for free, plus free gpt-image-2 image generation and Nanobanana video generation. All models are accessible through a unified gateway with a clean, user-friendly experience.

CyberCode Real UI Screenshot

CyberCode Web UI — Codex-dark theme, model selector on the left, skills panel on the right


Core Capabilities

Capability Details
Free Frontier Models GPT-5.5 / Claude Opus 4.8 / GLM-5.2 / Gemini 3.1 Pro / DeepSeek V4 Flash and 32+ models
Free Image Generation gpt-image-2 (1024x1024 / portrait / landscape), text-to-image and image editing
Free Video Generation Nanobanana model + HyperFrames HTML rendering engine, with edge-tts narration
9 Atomic Tools Code execution / File read-write / Web scraping / JS execution / Image gen / Vision / User interaction / Memory
Function Calling OpenAI tools-compatible function calling, streaming and non-streaming
3-Level Memory L0 meta-rules / L1 insight index / L2 stable facts / L3 task SOPs
Self-Contained Only Python stdlib + requests needed, no LangChain / Playwright / browser binary deps

Why CyberCode

Most AI clients on the market are either pure chat boxes (no execution capability) or heavy frameworks (depending on a pile of Node modules and browser kernels). CyberCode takes a different path:

Keep the Agent thin, make the models thick. The core is only about 1,300 lines of Python, but it connects to 30+ frontier models through a unified OpenAI-compatible interface. You write a requirement, it decides which tools to call, which files to read, which code to run — the whole process streams in real time, you watch it work.

Architecture Diagram

Architecture: Agent Core orchestrates, LLM Client streams, 9 tools each serve their purpose


32+ Built-in Frontier Models

All models are proxied through a unified gateway — switching models takes a single API call. Partial model list:

Conversation & Reasoning Models

Model Family Notes
gpt-5.5 OpenAI GPT Flagship conversational model
gpt-5.4 OpenAI GPT High cost-efficiency
gpt-5-mini OpenAI GPT Lightweight & fast
gpt-5.3-codex OpenAI Codex Code-specialized
gpt-4.1 OpenAI GPT Classic & stable
claude-opus-4-8 Anthropic Claude Top-tier reasoning
claude-opus-4-7 Anthropic Claude Long-context analysis
gemini-3.1-pro-preview Google Gemini Multimodal
gemini-3.5-flash Google Gemini Ultra-fast response
deepseek-v4-flash DeepSeek Top-tier domestic
deepseek-v4-pro DeepSeek Deep reasoning
deepseek-r1-14b DeepSeek R1 Reasoning chain
glm-5.2 Zhipu GLM Free tier, FC support
free/glm-5.2 Zhipu GLM Default model, zero cost
kimi-k2.7 Moonshot Kimi Ultra-long context
minimax-m3 MiniMax Strong general baseline
qwen-2.5-coder-14b Alibaba Qwen Code generation
llama-3.1-8b Meta Llama Open-source standard
mistral-small-24b Mistral European flagship

Multimedia Generation Models

Model Use Case Method
gpt-image-2 Text-to-image / Image-to-image Async creation-tasks API
codex-gpt-image-2 Image editing image-edits endpoint
nanobanana Video generation HyperFrames + ffmpeg pipeline
hy-mt1 Multimodal understanding Visual Q&A
Media Generation

Image generation & video creation — AI image gen on the left, HyperFrames timeline on the right


9 Atomic Tools

The Agent works not by "chatting" but by calling tools to complete tasks. CyberCode has 9 physical-level tools covering all aspects of system operation:

9 Atomic Tools
Tool Capability Example
tool_code_run Execute Python / Bash / Shell scripts Run data analysis, install deps, call system commands
tool_file_read Read any text file, with line numbers and ranges Check logs, read source, view config
tool_file_write Overwrite / append / prepend file content Generate code, write reports, edit config
tool_file_patch Precise search-and-replace file fragments Fix bugs, refactor functions
tool_web_scan Scrape web pages and extract body text Read docs, crawl data, look up info
tool_web_execute_js Run JavaScript in the browser Automate operations, extract dynamic content
tool_generate_image Generate images via gpt-image-2 Illustrations, UI mockups, artwork
tool_view_image Visually understand image content Read screenshots to find bugs, describe scenes
tool_ask_user Ask the user questions and wait for answers Clarify requirements, confirm risky operations

Additionally, two memory tools: update_working_checkpoint (short-term notes) and start_long_term_update (long-term experience), keeping the Agent on track during long tasks and smarter on repeated ones.


HyperFrames Video Engine

CyberCode includes the HyperFrames skill set — a framework for rendering video with HTML. You describe what video you want, the Agent automatically writes HTML compositions (with data-* timing attributes), then uses GSAP / Lottie / Three.js for animation, and finally ffmpeg composites an MP4 with audio.

User: "Make a 10-second cat science video with narration"
  ↓
Agent decision path:
  1. Call gpt-image-2 to generate 3 scene images
  2. Call edge-tts to synthesize narration MP3
  3. Write HyperFrames HTML composition (GSAP timeline + fade in/out)
  4. ffmpeg composites H.264 1920x1080 + AAC audio
  5. Self-check: duration, resolution, audio presence → 100/100
  ↓
Output: cat_video_final.mp4

HyperFrames includes 7 domain skills, loaded on demand:

Skill Purpose
hyperframes-core HTML composition author contract (data-* attrs, clips, tracks)
hyperframes-animation Atomic animations (GSAP / Lottie / Three.js / CSS / WAAPI)
hyperframes-creative Creative direction (color, typography, narration, beat)
hyperframes-media TTS voiceover, background music, subtitles, bg removal
hyperframes-cli Dev loop (init / lint / render / publish)
hyperframes-registry Registry component installation
general-video General video workflow routing

Quick Start

Option 1: npm One-Click Install (Recommended)

npm install -g cybercode-cli

After installation, type in your terminal:

cybercode web

The terminal will display the local service address — open it in your browser to see the CyberCode interface. No configuration needed — model keys, gateway addresses, and default parameters are all auto-configured out of the box.

Option 2: Clone the Repository

git clone https://github.com/ciouskeila-hue/cybercode-cli.git
cd cybercode-cli
python python/cybercodewebui.py

Also requires no manual configuration — after launch, visit http://localhost:18600 in your browser.

Getting Started

After opening the browser, you'll see a login screen. You have two options:

  • Log in to your CyberCode account: all models (GPT-5.5, Claude Opus 4.8, GLM-5.2, and 32+ frontier models) are immediately available, along with image and video generation. GitHub / LinuxDo one-click login supported.
  • Skip login: click the "Skip login" link at the bottom to use your own API key. No platform models will be scanned — you'll need to manually add models in Settings (API base URL, model name, API key).

After logging in, free/glm-5.2 (zero-cost model) is selected by default. You can switch to any other model in the left-side model selector at any time. Just type your request in the input box and the Agent will automatically call tools to complete the task.

Zero-config philosophy: CyberCode handles all underlying configuration at startup — model routing, key management, and token generation all happen in the background. As a user, you just log in (or skip login and configure your own models), and let the system handle the rest.


API Reference

CyberCode exposes a clean HTTP API for integration with other systems:

Method Endpoint Description
GET / Web UI page
GET /api/status Running status + current model
GET /api/sessions Conversation history list
GET /api/skills Skill document list
GET /api/messages?path= Replay a session
GET /api/videos Rendered MP4 list
GET /api/video/<relpath> Video stream (Range support)
POST /api/chat Send message, SSE streaming response
POST /api/chat (video:true) Video mode, injects HyperFrames preamble
POST /api/llm Switch LLM
POST /api/stop Abort current task
POST /api/new Start new session
POST /api/continue Restore historical session

Tech Stack & Acknowledgments

CyberCode's Agent core architecture (agent loop structure, 9-tool design, memory hierarchy philosophy, system prompt approach) is derived from the GenericAgent project, open-sourced by lsdefine under the MIT License:

Built on top of this, CyberCode adds:

  • Rewritten LLM Client with OpenAI-compatible streaming + function calling
  • Unified gateway proxy layer (model routing + session tokens)
  • HyperFrames video engine integration (HTML → ffmpeg pipeline)
  • edge-tts voice synthesis (auto-install logic built into system prompt)
  • Codex-dark themed Web UI (i18n Chinese/English)
  • Zero-config auto-deployment (cybercode web after npm install)

Dependencies

Dependency Purpose Required?
requests HTTP requests Yes
ffmpeg Video compositing Video mode only
edge-tts Voice synthesis Video narration only
Python stdlib Everything else Built-in

Project Structure

cybercode/
├── agent_core.py            # Agent core: LLM Client + 9 tools + agent loop
├── cybercodewebui.py        # Web server: HTTP API + SSE streaming + proxy layer
├── cybercodewebui.html      # Frontend: Codex-dark UI + i18n + real-time chat
├── mykey.json               # Model config (auto-generated, not in repo)
├── custom_system_prompt.txt # Custom system prompt (hot-reload)
├── .auth_token              # Auto-generated token (not in repo)
├── skills/                  # 14 skill documents
│   ├── hyperframes.md       # Video engine entry
│   ├── hyperframes-core.md  # HTML composition contract
│   ├── hyperframes-animation.md
│   ├── hyperframes-creative.md
│   ├── hyperframes-media.md
│   ├── hyperframes-cli.md
│   ├── hyperframes-registry.md
│   ├── image-gen.md         # Image generation API
│   ├── edge-tts-tts.md      # Voice synthesis
│   ├── general-video.md     # General video routing
│   ├── motion-graphics.md
│   ├── product-launch-video.md
│   ├── website-to-video.md
│   └── faceless-explainer.md
├── memory/                  # 3-level memory system
│   ├── global_mem.txt       # L2 stable facts
│   └── global_mem_insight.txt # L1 insight index
├── temp/                    # Working directory (gitignored)
├── docs/
│   └── images/              # README demo images
└── .gitignore

Usage Examples

Example 1: Write and Execute a Script

User: Write a Python script that counts lines in all .py files in the current directory, sorted by line count

Agent:
  → tool_file_write: write count_lines.py
  → tool_code_run: python count_lines.py
  → Returns: agent_core.py 1452 lines, cybercodewebui.py 1180 lines...
  → tool_file_patch: found bug, fixed sorting logic
  → tool_code_run: re-run, correct results
  → Summary: 3 Python files, 2632 total lines

Example 2: Generate an Image and Understand It

User: Generate a cyberpunk-style cat image, then tell me what's in it

Agent:
  → tool_generate_image: prompt="cyberpunk cat, neon lights, digital art"
  → Wait for async task, image saved to temp/
  → tool_view_image: analyze the generated image
  → Returns: An orange cat wearing neon goggles, background is purple and cyan city lights...

Example 3: Scrape a Web Page

User: Check what's new on python.org homepage

Agent:
  → tool_web_scan: url="https://python.org", text_only=true
  → Extract body text, filter navigation and footer
  → Returns: Python 3.13 released, PEP 7xx new proposal, PyCon 2026 dates announced...

Example 4: Generate a Video with Narration

User: Make a 10-second cat science video with Chinese narration

Agent (video mode, auto-injects HyperFrames preamble):
  → Generate 3 scene images (gpt-image-2)
  → Synthesize narration MP3 (edge-tts)
  → Write HTML composition (GSAP timeline + fade in/out + data-start audio sync)
  → ffmpeg composites H.264 1920x1080 + AAC
  → Self-check: 10s duration ✓ 1080p ✓ audio present ✓
  → Output: cat_video_final.mp4

Configuration

CyberCode uses a zero-config design — all configuration is automatic at startup. The following environment variables are available for advanced users:

Variable Default Description
CYBERCODE_PORT 18600 Web service port
CYBERCODE_HOST 127.0.0.1 Listen address

Auto-Configuration

CyberCode automatically handles the following at first launch — no user intervention needed:

  • Connects to the model gateway and fetches the available model list
  • Generates a session token (.auth_token)
  • Selects the default model (free/glm-5.2, zero cost)
  • Initializes the memory system and working directory

Users only need to log in to their CyberCode account (or skip login and configure their own models) — the system handles everything else.


FAQ

Q: Is it really free?

Yes. CyberCode defaults to the free/glm-5.2 model, which is completely free. After logging in to your CyberCode account, frontier models like GPT-5.5 and Claude Opus 4.8 are also available — specific quotas depend on platform policy. You can also skip login and use your own API key.

Q: Do I need a VPN?

No. The model gateway is directly accessible — works out of the box.

Q: Does it support function calling?

Yes. free/glm-5.2, deepseek-v4-flash, glm-5.2 and other models support OpenAI tools-compatible function calling, both streaming and non-streaming. Some models like gpt-5.4 don't support FC — in that case, the Agent falls back to XML tool-call parsing.

Q: What dependencies does video generation need?

ffmpeg (available in system PATH) and edge-tts (pip install edge-tts). The system prompt has built-in edge-tts auto-install logic — the Agent will detect and install it automatically on first use of video mode.

Q: Is my data uploaded?

No. CyberCode runs locally — all file operations, code execution, and memory storage stay on your local disk. Only LLM inference requests are sent to the model gateway.


Development

Local Debugging

# Clone and run directly
python python/cybercodewebui.py --port 18600 --host 0.0.0.0

# Start with a specific LLM index
python python/cybercodewebui.py --llm_no 4

Custom System Prompt

Edit custom_system_prompt.txt — content is hot-reloaded into the system prompt of every conversation. Ideal for injecting project-specific constraints or domain knowledge.

Custom Skills

Create .md files in the skills/ directory — the Agent will list them in /api/skills and load them on demand.


Disclaimer

Please read the following terms carefully. By using this software, you acknowledge that you have read and agree to all of the terms below.

  • For learning and entertainment only: This tool is intended solely for learning, research, and technical exploration. It must not be used for any commercial purpose or any illegal activity.
  • Do not upload personal data: Never enter any personal or sensitive data (including but not limited to ID numbers, phone numbers, bank card numbers, real names, addresses, etc.) into the system. Your inputs are sent to third-party LLM services via the model gateway for inference.
  • Do not redistribute generated content; delete immediately: Content generated by the system may be inaccurate, incomplete, or inappropriate. Do not forward, publish, or disseminate generated content to others, and delete it immediately after use. You assume full responsibility for the use of generated content and any consequences thereof.
  • No warranty: This software is provided "as is", without any express or implied warranty. The author shall not be liable for any direct or indirect damages arising from the use of this software.

License

MIT License — see LICENSE for details.

CyberCode's Agent core architecture is derived from GenericAgent (© 2025 lsdefine, MIT). Acknowledgments.


CyberCode — Let frontier LLMs actually get things done.

Yorumlar (0)

Sonuc bulunamadi