Qwen Code Weekly: Parallel Agent Panel, Auto-Memory On by Default, Worktree Phase D

Qwen Team

2026-05-28

The coding agent space heated up another notch this week. Cognition announced over $1B in funding at a$ 26B valuation, and Simon Willison declared that Anthropic and OpenAI have found PMF through coding agents —at the cost of significantly higher enterprise spending.

Qwen Code shipped v0.16.2 this week with 30+ PRs merged, covering three directions users feel most directly:

Parallel agents are finally visible. Previously when multiple agents ran concurrently, all you could do was watch logs scroll. Now there’s a compact panel—one line per sub-agent, keyboard navigation, real-time progress. This is the first truly usable interaction interface for “agents evolving from single-threaded assistants to multi-threaded teams.”

Memory requires zero configuration. auto-dream and auto-skill are now on by default. The agent learns your preferences and project conventions as you use it, persisting across sessions. New /memory command to view and toggle anytime. Compared to Claude Code still requiring manual CLAUDE.md maintenance, this is a significant UX gap.

Isolated environment in one parameter. Worktree advanced from last week’s Phase C (session persistence) to Phase D—qwen-code --worktree launches directly into an independent work tree, --worktree=#<N> fetches remote PR code. Optional worktree.symlinkDirectories config symlinks heavy directories like node_modules to the main repo, avoiding redundant installs. Combined with /goal + Auto Approval, the agent runs a full task autonomously in isolation while keeping main branch clean.

Additionally, long conversation overflow saw real progress this week: auto-compression was redesigned from a single threshold to a warn / auto / hard three-tier ladder, reducing wasted headroom for large-window models from 30% to a fixed 33K. Headless mode gained --max-wall-time and --max-tool-calls runtime budgets—finally a hard limit for running agents in CI.

✨ New Features

Parallel Agent Panel: All Agents Visible at a Glance

If you’ve been following coding agent trends, “parallel” is the most mentioned keyword. Cognition’s $26B valuation sells the story of multiple agents working simultaneously. But when multiple agents are running, how do you know who’s doing what and where they are?

Qwen Code’s approach: when 2 or more agents execute in parallel, an inline compact panel (InlineParallelAgentsDisplay) renders automatically. Each sub-agent gets one line showing status, name, current operation, elapsed time, and token consumption. Keyboard ↑↓ switches focus, Enter expands details for full output, Esc returns to input box, and typing automatically cancels focus back to input—fully mouse-free.

What you can do with it:

When /review and similar commands launch multiple sub-agents, see all progress at a glance (status/name/operation/time/tokens)
Keyboard navigation: ↑↓ to select, Enter to expand details, Esc to return to input—typing auto-cancels focus
Panel updates in real-time during live phase, then freezes as scrollable history on completion

See PR #4477

Auto-Memory On by Default + /memory Toggle

Claude Code is pushing hard on auto-memory—users have to manually create CLAUDE.md and write rules for the agent to remember. The problem is many people don’t even know this file exists, let alone how to maintain it.

Qwen Code turned auto-dream and auto-skill on by default this week. This means: things the agent learns during conversation (your preferences, project conventions, common patterns) are automatically written to memory and loaded in the next session. No configuration, no file creation, works out of the box.

If you want manual control, the /memory dialog now includes an Auto-skill toggle row alongside existing Auto-memory and Auto-dream—three toggles navigated with arrow keys, toggled with Enter, settings persisted to workspace’s .qwen/settings.json.

What you can do with it:

Zero configuration needed—all three managed-memory subsystems (summary memory, staged consolidation, skill extraction) are on by default
Preferences and project conventions persist across sessions without repeating yourself
/memory dialog provides three toggle rows to enable/disable any subsystem anytime
Existing explicit settings are unaffected—new defaults don’t override your previous choices

See PR #4547

Worktree Phase D: Launch Directly Into Isolation

Last week Worktree reached Phase C—session persistence + status bar display. This week jumps straight to Phase D: add --worktree at launch and Qwen Code automatically creates and enters an independent work tree. With PR refs, you can pull any PR’s code into an isolated environment for review or continued development without touching main branch.

Another practical config: worktree.symlinkDirectories settings support symlinking heavy directories like node_modules and .venv in the worktree back to the main repo, avoiding redundant dependency installs. This is opt-in—configure path list in .qwen/settings.json, and all subsequent worktrees (--worktree, enter_worktree tool, Agent isolation) apply automatically. Path traversal, absolute paths, .git and .qwen directories are safely rejected.

What you can do with it:

qwen --worktree enters isolation with one parameter—no manual git worktree add
--worktree=#<N> or GitHub PR URL: fetches remote PR via git fetch origin pull/<N>/head into a worktree for review or continued development
Configure worktree.symlinkDirectories to share large directories like node_modules via symlinks without reinstalling
Forms a complete loop with last week’s session persistence + /resume

See PR #4381

Headless Protection: Runtime Budgets for Non-Interactive Mode

Anthropic published a Zero Trust AI Agent Security Framework this week, noting that a major risk of autonomous agents is “running away with no one to stop them.” Baoyu (@dotey ) is saying the same thing: the key to using coding agents well lies at both ends—planning at the start and review at the end.

Qwen Code added two runtime budget parameters: --max-wall-time (maximum runtime, supporting formats like 90, 30s, 5m, 1h) and --max-tool-calls (maximum tool call count). When budget is exhausted, it terminates with exit code 55, distinct from user cancellation (130) and turn limits (53)—CI scripts can pinpoint the termination reason.

Additionally, when using --yolo (full auto-approval) without a sandbox in headless mode, a security warning is printed to stderr at startup, suppressible with QWEN_CODE_SUPPRESS_YOLO_WARNING=1.

What you can do with it:

qwen -p "refactor module" --max-wall-time 5m --max-tool-calls 50—hard limits for agents in CI
Auto-stops on budget exhaustion, exit code 55, CI scripts can distinguish “budget exhausted” from “completed normally”
All budgets default to -1 (unlimited), existing usage unaffected
--yolo without sandbox startup warning helps discover unsafe configurations

See PR #4502

Three-Tier Auto-Compression: No More Long Conversation Overflow

Simon Willison says coding agents are significantly increasing enterprise costs. A core reason: long conversation context keeps growing, and when it approaches the model’s context window it either crashes or loses critical information.

Qwen Code redesigned auto-compression from a single threshold (triggered at 70% window) to a three-tier ladder, using both occupancy ratio and absolute byte count:

warn tier (≈60% window or 20K from auto tier): alerts user that context is approaching limits—a UX-level early warning
auto tier (≈70% window or 13K from window edge): proactively triggers compression, low cost, nearly lossless conversation quality
hard tier (3K from window edge): forced compression—the last safety net before API rejects an oversized prompt

Small-window models (32K/64K) use the ratio branch, large-window models use the absolute byte branch—wasted headroom drops from 30% of window to approximately 33K fixed overhead.

What you can do with it:

Long conversations no longer crash from context overflow—hard tier force-compresses before API rejection
No manual configuration needed, system auto-calculates three-tier thresholds based on model window size
Old chatCompression.contextPercentageThreshold config is deprecated, outputs deprecation warning at startup

See PR #4345

Command Substitution No Longer Rejected Outright: Confirmation + ⚠️ Warning

Anthropic’s Zero Trust Security Framework notes that agent security boundaries need to balance “usable” and “secure.” Qwen Code previously went too strict: if agent-generated shell commands contained $(), backticks, <() or other command substitution syntax, execution was flatly refused—even YOLO mode couldn’t bypass it.

Now it shows a confirmation dialog with a ⚠️ “Contains command substitution” warning. You see the warning and choose to allow or deny—the decision is yours, no more blanket rejections. YOLO mode executes normally with an audit record in DEBUG logs for post-hoc review.

What you can do with it:

Legitimate scenarios like $(git rev-parse HEAD) to get commit hash are no longer blocked
Confirmation dialog shows ⚠️ warning so you immediately know the command contains substitution syntax
YOLO mode executes normally, DEBUG logs retain audit records
Monitor tool fixes the same overly-strict rejection issue

See PR #4386

More New Features

Feature	PR	Impact
Token Plan Cache Control: prompt cache saves repeated billing	#4495	Token Plan users get auto-cached system prompts in consecutive conversations, saves more with three-tier compression
memory-leak-debug skill: built-in memory leak diagnosis	#4468	Agent has systematic heap snapshot analysis workflow for memory leaks
Directory completion without trailing space	#4288	Tab-completing directories lets you continue typing sub-paths without deleting spaces
MCP server removals persisted	#4535	Removed MCP servers stay removed after restart
@ trigger requires preceding space	#4487	Email addresses and other @-containing text no longer falsely trigger file completion
Paste multiple file paths auto-adds @	#4544	Pasting multiple paths from terminal automatically converts to @ references
Project-local context .qwen/QWEN.local.md	#4394	Create `.qwen/QWEN.local.md` in project root for personal local rules, agent auto-loads
new app prompt migrated to skills	#4567	New app guidance prompts are more flexible and customizable
ACP Streamable HTTP transport	#4472	Daemon adds standard ACP protocol endpoint for third-party UI integration
Daemon followup_suggestion push	#4507	Web UI auto-pushes next-step suggestions after session ends
Background agent concurrency limit	#4324	Background agents won’t exhaust resources by running too many simultaneously
W3C traceparent propagation	#4390	Enterprises can trace complete call chains for agent requests
Daemon CORS allowlist	#4527	`--allow-origin` precisely controls which frontends can access Daemon
Daemon in-session model switching	#4546	Switch models mid-conversation without exiting
Daemon file logging	#4559	Daemon runtime logs written to file for easier debugging
Serve prompt deadline + SSE idle timeout	#4530	Long-unresponsive requests auto-timeout instead of blocking resources
Serve POST /session/:id/recap	#4504	Get session summaries for Web UI historical conversation overviews

🔧 Notable Fixes

PR	Fix	Impact
#4366	Fix long session AbortSignal listener leak (MaxListenersExceededWarning)	Long conversations no longer emit listener overflow warnings
#4489	auto-skill no longer overwrites existing skill files	Manually written skills won’t be overwritten by auto-created ones
#4407	safeJsonStringify preserves repeated object references	Complex data structures no longer lose information during serialization
#4426	Redact credentials from source diagnostic info	Error messages no longer leak keys/tokens
#4371	Strip more dangerous interpreter rules	Agent permission boundaries are stricter
#4478	Token Plan model defaults aligned with ModelStudio	Token Plan users see model lists consistent with the platform
#4512	/context token stats aligned with actual API requests	`/context` finally shows accurate token counts
#4465	WeChat integration allows Windows workspace image paths	Windows users no longer get path errors when sending images via WeChat
#4464	WeChat integration sends decryptable image payloads	Images sent via WeChat integration are now viewable by recipients
#4517	Refresh raw model derived defaults	Parameter configs update correctly after switching models
#4461	Show startup warnings in stderr before TUI renders	Startup issues no longer swallowed by TUI interface
#4470	Fix stale closure race in text buffer commit handler	Fast input submissions no longer occasionally lose content
#4510	Daemon cross-client sync follow-ups (epoch-reset, approval-mode serialization)	Multi-client connections to same Daemon have more stable state sync
#4497	Claude marketplace correctly populates resources when pointing to folder	Resource loading no longer empty when using Claude marketplace plugins
#4491	SDK canUseTool timeout honored in CLI control requests	Tool call timeouts no longer ignored in SDK integrations
#4453	Clean stale outputs before build to prevent TS5055	Developers building from source no longer get TypeScript incremental build errors

🙏 Contributors

Thanks to all contributors this week:

Contributor	Key Contributions
@LaZzyMan	Worktree Phase D, three-tier auto-compression, auto-memory default-on, auto-skill overwrite fix, command substitution permission fix
@wenshao	Parallel Agent panel, @ trigger fix
@DragonnZhang	Token Plan cache control, SDK fix, monitor docs
@doudouOUC	W3C traceparent propagation, Serve deadline/recap, AbortSignal leak fix, telemetry
@chiga0	Daemon in-session model switching, cross-client sync fix
@huww98	memory-leak-debug skill, stale closure fix
@yiliang114	WeChat integration fixes, OSS sync CI
@BZ-D	Headless protection
@dykebo	Directory completion optimization
@kkhomej33-netizen	Background agent concurrency limit
@pomelo-nwu	/context alignment fix, PR template refactor
@qqqys	Credential redaction, interpreter rule security hardening
@Jerry2003826	MCP removals persistence, model defaults fix
@MikeWang0316tw	Multi-file paste auto-adds @
@DennisYu07	new app prompt migration
@YingchaoX	Token Plan defaults alignment
@ihubanov	safeJsonStringify fix
@kagura-agent	Startup warning display (first contribution 🎉)

How to upgrade: Run npm i @qwen-code/qwen-code@latest -g to get the latest version.

For questions or suggestions, feel free to open an issue on GitHub Issues !