研究随笔 · Neural Computers

作者：Mingchen Zhuge 发布于 2026 年 4 月 7 日更新于 2026 年 4 月 7 日

Neural Computer：一种新的机器形态，正在出现

TL;DR: 它在押注机器本身会学会运行。

Paper (arXiv) GitHub English Version

If you'd like to continue the conversation, feel free to reach out via:

Emailmczhuge [AT] gmail.com
X@MingchenZhuge
WeChat

若你也曾想过：“AI 最终会成为一种计算机”，那本文就是写给你的。

过去几十年，计算机逐渐成为人类完成任务的重要媒介。最近几年，AI 也开始进入这个位置：它不再只回答问题，还开始调用工具、操作界面、参与真实工作流。问题随之变了：我们期待 AI 使用计算机，还是成为一种计算机？

Neural Computer（神经计算机，NC）想讨论的，正是这个问题。它关心的是，模型能不能接手一部分原本属于机器运行本身的职责。

先澄清一下：这里说的 NC，不完全是 Alex Graves 那条 NTM / DNC 路线[1][2]，也不是在谈某种新硬件。本文更想讨论的，是一台 learning machine 会不会从“使用计算机”走向“成为计算机”。

它存在的价值不是更强的 agent，不是计算机环境的 world model，也不是传统计算机上外挂的一层智能。它关心的是，让原本常常外包给程序栈、工具链和控制层的那部分系统职责，逐渐进入模型实际依赖的 Runtime。这个念头我认为在很多人脑海里闪过，我暂且称之为“前共识”。

先说结论

Neural Computer（NC）讨论的是，模型会不会逐渐接手机器本身的一部分运行职责。
传统计算机围绕显式程序，Agent 围绕任务，World Model 围绕环境，NC 围绕 Runtime。
Completely Neural Computer（完备神经计算机，CNC）是 NC 的完备态。
当前原型已经开始证明一部分早期 Runtime 原语。
如果能力开始进入 Runtime，并在那里被安装、复用和治理，Neural Computer 会重新定义“计算机”。

1. 为什么是现在：“一种新的机器形态”正在出现

今天同时发生着三件事。

第一，agent 变得越来越会做事。 从 2023年的 MetaGPT (“古早” Coding Agents之一)[3] 只能勉强写出几百行代码，到2025年 Cursor、Codex 和 Claude Code 已经成为程序员生产力标配工具，再到今天 OpenClaw[4] 真正走入大众视野，大家关心的已经不是 agent 偶尔把一件事做成，而是它能不能进入真实生产和日常生活，稳定替你处理各类事务。

对 agent 来说，当前共识的 bottleneck 是：（1）长时程任务怎样保持稳定，（2）能力怎样沉淀，（3）流程怎样持续复用。眼下的解决路径，仍主要是在 agent 的 scaffold（或 harness）一侧做加法：用更强的记忆、更长的工作流和更稳的行动闭环，尽可能把事情做成。再往前推，更激进的一条路径是递归式自我改进：模型训练下一代模型，agent 持续改写自身[5]。

MetaGPT paper front page — Agent 正在完成从原型实验，到专业生产力工具，再到大众日常基础设施的转变。[3][4]

OpenClaw repository popularity chart — Agent 正在完成从原型实验，到专业生产力工具，再到大众日常基础设施的转变。[3][4]

第二，world model 变得越来越擅长建模动态环境。 它本来就模拟环境演化；现在更值得注意的是，这种能力已经进入一些真实闭环。尤其在现实里难以低成本、反复采集的 corner case 上，rollout 正被直接用进预测、规划、控制和训练。沿着这条线看，从 Jürgen Schmidhuber 在 1990 年提出的 Making the World Differentiable[6]，到 2018 年的《World Models》[7]，再到现在 Waymo 把 world model 用进自动驾驶仿真与训练[8][9]，这条路线已经在变成系统能力。

world model 擅长的，是先把未来铺开。它补的是系统的可推演性：采取某个动作，环境会往哪里走；即使暂时不直接行动，系统也能先生成若干未来，提前试验并暴露风险。今天这条路线已经分出几种明显方向：在自动驾驶和 physical AI 里，它主要扮演仿真与合成数据引擎的角色，用来补足真实世界里昂贵、危险或稀缺的数据，例如 Waymo World Model 和 NVIDIA Cosmos[8][10]；在 spatial intelligence 里，它追求可生成、可进入、可持续交互的 3D 世界，例如 World Labs 的 Marble[11]；在更偏实时互动世界的方向上，生成模型已经从静态内容生成走向可控、可交互、可探索的环境生成，代表性例子包括 GameNGen 对 DOOM 的实时神经模拟[12]，以及 Google DeepMind 的 Genie 2 / Genie 3[13][14]。这些方向虽然已经分化，底下碰的还是同一类问题：怎样把环境随时间、动作和约束而演化的规律，学进系统内部。

Diagram from Jürgen Schmidhuber's 1990 Making the World Differentiable paper — 从 1990 到 2018 再到现在：world model 从早期可微世界建模设想，走向 Waymo World Model 代表的自动驾驶仿真与训练[6][7][8][9]

World Models paper front page — 从 1990 到 2018 再到现在：world model 从早期可微世界建模设想，走向 Waymo World Model 代表的自动驾驶仿真与训练[6][7][8][9]

第三，传统计算机在 AI 时代的结构性摩擦越来越明显。 今天大量任务都是开放式的、长时程的、持续交互的，而这恰恰是传统软件栈开始显得笨重的地方。传统软件栈固然有稳定优势，但在许多以自然语言、示范、界面操作和弱约束为主的场景中，组织和驱动这些任务的成本已经越来越高。

传统计算机本身也在为 AI 重写底座。芯片、编译器、内存系统、软件栈都在变得更 model-friendly。但这些变化多数仍然发生在既有计算范式内部：它们让旧机器更适合 AI，却没有改写“机器是什么”。在这些变化里，像 Taalas 这样的路线把事情又往前推了一步，开始把特定模型做成一种部署单元：模型不再只是跑在机器上的负载，而是在逼近“按模型组织硬件”这条线[15]。但至少今天，这还只是部署层的变化，还谈不上通用机器形态。

把这三件事放在一起，问题就更具体了：如果 agent 在变得更会做事，world model 在变得更会推演，而传统计算机也在为 AI 重写底座，那么会不会出现一种新的 Runtime，把执行、rollout 和能力沉淀统一进同一台 learning machine 里？

从关系结构上看，这对应的是人和机器之间主关系的一次迁移：在传统计算里，人主要和 computer 交互；到了 agent 时代，人更多是和 agent 交互，再由 agent 去调用 computer 把事情做成。world model 在这里更接近一个并行的预测层：它既可以服务于 human，也可以服务于 agent，但本身不负责把事情做成。再往前推，NC 要改的是机器本身：它试图把今天分散在 computer、agent 和 world model 之间的职责，收拢到同一台 learning machine 内部。那时，人面对的就不再只是“agent 代替自己调用 computer”，而是直接使用这样一台神经计算机。

How the human-machine relation changes — 人机关系如何变化：过去更像是 Human → Computer；到了 agent 时代，关系更像 Human → Agent → Computer，而 World Model 更多作为并行的预测层出现；如果 NC 成立，人将更直接面对一台 Neural Computer。

这也说明，交互本身会带有“编程”的意味。今天，自然语言指令、键鼠轨迹、屏幕变化和任务反馈，大多只是过程日志；在 NC 的设定里，它们会变成塑造未来行为的材料。今天我们主要通过代码安装能力；以后，示范、交互轨迹和约束本身，也可能成为能力进入 Runtime 的入口。

2. 什么是 Neural Computer，什么才算它真正成立？

先看一张表：它把传统计算机、Agent、World Model 和 Neural Computer 放到同一把“尺子”上比较。看完这张表，区别和联系就很清楚了：它们各自围绕什么组织，source of truth 落在哪里，又分别承担什么职责。

形态	围绕什么组织	source of truth 落在哪里	主要职责
传统计算机	显式程序	显式程序与显式状态	稳定执行显式程序
Agent	任务	外部环境、工具链与工作流	在既有环境中完成任务
World Model	环境	状态演化模型	预测与推演环境变化
Neural Computer	Runtime	Runtime 里的能力与状态	让机器持续运行、沉淀能力并治理更新

这张表已经很直观了，我就不再重复。接下来不如直接设想一下：如果 NC 已经存在，人会怎么使用它？对传统计算机，你安装的是软件；对 agent，你描述的是任务；对 NC，你做的更接近给机器安装能力，并期待这些能力以后继续留在机器里。

也正因为如此，这里说的 Runtime，不是某个软件组件，而是系统靠什么持续成为同一台机器的那一层：什么会留下来，什么推动状态继续往前走，什么输入真正改变机器，什么变化已经等于把机器重写了一次。对 NC 来说，关键不是再叠一层外部工具，而是能力和状态能不能真正住进同一个 learned runtime。

如果它成立，机器会长得像什么？

第一，它未必会继续沿着今天这条 foundation model 路线长下去。 今天更自然的想法，是把模型继续往 1B - 10T 级的 dense / MoE foundation model 推大、推强；很多工作也确实沿着这条路在前进。但在我的想法里，NC 真正成熟以后，底座更可能往另一边走：10T - 1000T 级，更稀疏、更可寻址、带一点 circuit 气质。未来的 CNC 也许不是一团越来越大的连续表征，而会更像一套可路由、可组合、局部更容易检查的机器底座。它未必要模仿动物感知或人脑，反而可能更接近一种带有 NAND 气质的神经网络：离散、稀疏、局部可验证。至少目前，这条路还没被系统展开；但 OpenAI 最近在 weight-sparse transformers 上做的一些工作，至少说明了一件事：把神经网络做得更稀疏、更局部、更可路由，未必只是为了好解释，它本身就可能是一条机器式架构路线[16]。

第二，它也未必总靠整体改参数来升级自己。 NC 指向的则是另一种进化方式：靠 Runtime 的自编程与持续交互，让机器沿内部能力结构持续自进化。用户输入不再只是触发一次性行为，而会逐渐安装、调用、组合并保留可复用的 neural routines，甚至形成以后还能继续调用的内部 executor（执行单元）。至少在功能分工上，它更接近传统计算机里的“内存”，而不是处理器：升级未必意味着重写整台机器的本体，也可能只是把这些新结构稳定写进一层可寻址、可调用、可保留的内部状态。顺着这条路往前走，升级也不再只是“换一个更大的 model”，而更像是在机器内部持续安装新部件。若干年前的 NPI 和 HyperNetworks，也能看作相似但还不完整的早期思路：前者试图把复杂程序拆成可调用、可组合的子程序[17]；后者则提示，机器甚至可能继续生成下游 neural modules，去扩展自己的能力边界[18]。当然，我认为野心可以更大一点，一个足够强的 Neural Computer，完全可能直接生成新的 (sub-)NNs，再把它们以可插拔的方式挂进自身内部，像今天安装或卸载软件一样自然，只是这一次省掉了手写代码和编译这一层中介。

第三，它还可能把 world model 式的 rollout 逐渐收进 Runtime 里。 到那时，rollout 会慢慢变成机器的日常机制，也会变成这种自编程和自进化的一部分。人类可以给出输入、期待的输出（GT），也可以只提前写好评估指标；甚至在某一轮里什么都不再给，Runtime 也可以在内部持续 self-play、自测、筛选和压缩候选做法，再把有效改进沉淀成下一轮能力更新。理想状态下，人去睡觉时，机器还在内部完成评估、试错和迭代。真正留下来的，不只是更多上下文，而是内部能力结构本身发生了变化。当然，这一切的前提不是放任系统偷偷变化，而是 update 路径本身可被治理。

这样看，NC 作为一种机器形态的轮廓就出来了。它要紧的是能力有没有真的住进 Runtime，并在那里被安装、复用、执行和治理。CNC 说的，就是这件事做成之后的样子（完备态）。按原论文的定义，一个 NC 实例只有在同时满足四个条件时，才可以算作 CNC：它必须是 Turing complete、universally programmable、除非被显式重编程否则保持 behavior-consistent，并体现 NC 相对传统计算机的架构与编程语义。下面这张表，就是对原论文这四条要求的一个更直白的总结。

CNC 条件	更直白地说	工程上大概要看到什么
Turing complete	不是只能完成几类固定任务，而是在原则上具备通用计算的表达能力。	但“可表达”不等于“可执行”：真正要看的是，随着有效 memory 和 context 增长，同一个 NC 能否稳定承接更长、更复杂的算法过程，而不是任务一拉长就换一种失效方式。
Universally programmable	输入给它的，不该只触发一次行为，而应能真正安装成以后还能调用的 routine 或内部 executor。	能力可以被安装、调用、组合、保留，并在进入 Runtime 后跨任务稳定复用。
Behavior-consistent	日常使用不应偷偷改变机器；行为变化只能来自显式更新。	同版本行为可复现，执行与更新轨迹都可追踪，出了问题能 replay / rollback，长时程 drift 可以被测量和治理。
Machine-native semantics	它不只是用神经网络去模仿旧计算机，而是开始形成自己的机器语义和编程方式。	神经底座能靠组合、路由、连续状态和内部执行结构带来传统栈不擅长的能力；同时，instructions、demonstrations、traces 和 constraints 本身开始成为编程入口，而不再只有手写代码。

3. 论文实现的原型：它证明了什么，还缺什么

我期待的 Neural Computer 周期，大概还要三年。所以，和我真正设想的 Neural Computer 相比，我们论文里的工作还只是很早期的一步。放在今天，我最顺手的统一载体，还是这类面向视频生成和 world model 的神经网络；要先把像素、动作和时间 rollout 放进同一个端到端原型里，它们也是最快的一条路。我们现在借它们验证的，只是 NC 的一部分关键能力。它们更像过渡性的实现参考，而不是 NC 的终局结构；如果真要走到 CNC，最后仍然需要一次更彻底的、自底向上的重建。

3.1 先看 CLIGen (General)：以假乱真的“计算机模仿游戏”

先看 terminal rendering 是否站住：配色、光标、滚动、TUI 和整体节奏感。

先来看看第一组实验生成出来的样子。你会发现，不认真看时，它们已经有点以假乱真。对 CLIGen (General) 来说，这里先看到的，是 video models 已经能把 terminal rendering 做到足够像真。主流视频模型本来就不是为这种文字密集、强依赖离散布局的计算机场景训练的；但经过进一步训练以后，“计算机模仿游戏”（Imitation Game for Computers）确实已经可以做出来。

Neural Computer (CLIGen General 1)The user types the command CREATE TABLE posts (ID INTEGER), with the terminal displaying the command in a dark background with colored syntax highlighting, including green and yellow text, and the cursor moving character-by-character as the user types, with some corrections and backspacing along the way. The output shows the command being executed, with key words like CREATE and TABLE in distinct colors, and the filename posts appearing in the command line.

Neural Computer (CLIGen General 2)The terminal displays a series of ANSI escape code formatted texts with changing background and foreground colors, executing commands like \u001b[48;2;255;128;128;38;2;0;0;0m which set the background to a shade of pink and text to black, and printing numbered lists with colors. The output includes specific numbers, such as "1", "5", "7", and "9", in different colors, creating a visually dynamic and colorful display, but the exact username, hostname, and path are not specified in the provided terminal session content.

Neural Computer (CLIGen General 3)At the root@localhost:~# prompt, the user types the date command, which displays the current date and time in a plain text format as "2021. 10. 11. 22:47:43 KST", then begins typing the cat command.

Neural Computer (CLIGen General 4)The terminal displaying progress bars, package names like pillow, notebook, and tzlocal, and version changes in green and red text. The output shows downloading and installing statuses, including percentages, for packages like smmap, tomli, and protobuf, with the terminal scrolling through the output rapidly.

Neural Computer (CLIGen General 5)At the unspecified username@hostname prompt, the terminal displays a partition editor with a disk image file named "sd.img" (128 MiB) and the user interacts with it, creating a new Linux partition from free space, with key output content showing partition details in a table format, including "sd.img1" and "sd.img2" with their respective sizes and types, and a new partition "sd.img3" with 55M size and Linux type (83). The terminal shows a mix of black and colored text, including blue and red, with a cursor that blinks and moves to different parts of the screen as the user navigates through the partition editor options, such as "New", "Quit", and "Write", with specific prompts like "Partition type: Linux (83)" and "Create new partition from free space".

Neural Computer (CLIGen General 6)The terminal displays a progress bar with the command output "Evaluating" and percentages from 60% to 85%, showing yellow progress bars with increasing completion, such as "│████████████████████▍ │" to "│████████████████████████▉ │", alongside item counts "24/40" to "34/40" and time estimates "0:00:20" to "0:00:07". The output includes specific item completion and estimated time remaining, with the yellow-colored progress bar indicating the evaluation progress.

这一组先学进去的，是终端最外层的那些东西：配色怎么变，光标怎么闪，窗口比例稳不稳，长日志怎么滚，全屏 TUI、进度条和状态栏怎么出现。先站住的，也是终端这层表象和节奏。借前文的说法，这里先被学到的，还是 Runtime 的外观。

放回 2025 年 9 月看，这个实验是让人惊喜的。只用约 1,100 小时富有噪声的终端数据集，就把原本几乎不懂计算机界面、连稍微小一点的文字都很难生成的 Wan2.1[31]，拉到了能稳定生成 terminal 表示的程度，对常见命令、回显和日志形态也已经有了相当可观的浅层对齐。对视频生成来说，这种文字密集、变化快、带闪烁、又几乎没有自然动态的场景，本来就是最难的一类；但这个结果确实超出了当时不少人的预期。这里用的还是 terminal 领域的 general 视频，风格很多，场景也很杂。terminal rendering 先站住了，后面鼓励我们去尝试计算机里那些更硬的东西：记忆、推理、编程和执行。

3.2 再看 REPL 和 Math：它不再只“画终端”

这里看的是更硬的 execution 结构：输入、回车、回显、局部编辑和状态延续。

Terminal rendering 的初步实验结束后，我们看看更有趣的是：终端能不能被当成一个能被动作稳定推动的局部机器来测。敲一个命令，buffer 会不会往前走；按一次回车，回显会不会跟着出来；输错、删改、重打之后，状态还能不能接着延续。REPL 和 Math 在这里其实是同一件事的两个切面：模型到底有没有开始摸到一点 computer physics。

Neural Computer (CLIGen Clean 1)Sleep 200ms
Type "env | head -n 5"
Enter
Sleep 600ms
Hide

Neural Computer (CLIGen Clean 2)Sleep 200ms
Type "date"
Enter
Sleep 300ms
Type "whoami"
Enter
Sleep 300ms

Neural Computer (CLIGen Clean 3)Sleep 200ms
Type "date"
Enter
Sleep 300ms
Type "whomai"
Enter
Sleep 300ms
Type "whomai"
Enter
Sleep 300ms
Hide

Neural Computer (CLIGen Clean 4)Sleep 200ms
Type "top"
Enter
Sleep 2s
Down 3
Sleep 600ms
Up 2
Hide

Neural Computer (CLIGen Clean 5)Sleep 500ms
Type "echo $HOME"
Sleep 90ms
Enter
Sleep 1442ms
Hide

Neural Computer (CLIGen Clean 6)Sleep 200ms
Type "id"
Enter
Sleep 400ms
Hide

Neural Computer (CLIGen Clean 7)Sleep 200ms
Type "pwd"
Enter
Sleep 400ms
Hide

Neural Computer (CLIGen Clean 8)Sleep 400ms
Type "python - <<'PY'"
Enter
Type "import time"
Enter
Type "for i in range(18):"
Enter
Type " print(f'Frame
{i:02d} ::' + '>' * (i % 20))"
Enter
Type " time.sleep(0.2)"
Enter
Type "PY"
Enter
Sleep 4000ms
Hide

Neural Computer (CLIGen Clean 9)Sleep 400ms
Type "seq 1 28 | paste -
d',' - - - - | column -t -s','
| tee metrics_7x4.txt"
Enter
Sleep 2000ms
Hide

Neural Computer (CLIGen Clean 10)Sleep 180ms
Type "echo History size:
$HISTSIZE"
Sleep 120ms
Enter
Sleep 400ms
Type "cal"
Sleep 120ms
Enter
Sleep 400ms
Type "echo Home:
$HOME"
Sleep 120ms
Enter
Sleep 400ms
Sleep 400ms
Hide

Neural Computer (CLIGen Clean 11)Sleep 800ms
Sleep 180ms
Type "echo Learning shell
basics"
Sleep 120ms
Enter
Sleep 400ms
Type "date +%Y-%m-%d"
Sleep 120ms
Enter
Sleep 400ms
Type "echo Login shell: $0"
Sleep 120ms
Enter
Sleep 400ms
Type "uname -r"
Sleep 120ms
Enter
Sleep 400ms
Sleep

Neural Computer (CLIGen Clean 12)Sleep 200ms
Type "python"
Enter
Sleep 400ms
Type "5"
Enter
Sleep 400ms
Type "exit()"
Enter
Sleep 400ms
Hide

Neural Computer (CLIGen Clean 13)Sleep 200ms
Type "python"
Enter
Sleep 1s
Type "10+15"
Enter
Sleep 800ms
Hide

Neural Computer (CLIGen Clean 14)Sleep 200ms
Type "python"
Enter
Sleep 1s
Type "40/1"
Enter
Sleep 800ms
Hide

现在，我们的中心压到指令运行因果上。这一组训练数据来自更干净、更可重复的脚本轨迹：我们自己通过脚本和 Docker 环境去生成这批终端视频，让输入、回车、回显、报错和局部编辑都落在一个更稳定的终端环境里。

从这组结果里已经能看出来，模型学到了一些 computer terminal 最基本的运行规律。像 pwd、date、whoami、echo $HOME、env | head -n 5 这类非常简单的指令，输入、回车、回显和结果展示已经可以做得相当接近真实；不同命令该出现什么样的输出形态，也和对应的终端场景对上了。和上一部分实验相比，指令本身已经能推动字符更新、回显生成和局部状态变化，终端也会按照自己的运行方式展开。

沿着这条线继续往前推，模型在简单数学场景里其实也已经摸到了一些东西，但推理能力本身还没有被真正解决。到了两位数加法这种最基础的算术层，当前模型依然很难稳定算对。这里当然有数据量的问题：我们还没给模型足够多、足够硬的训练数据去逼出稳定推理；但也有另一种更根本的可能性：用当前这类 DiT-based 视频模型去承载稳定推理，本身就可能是个伪命题。眼下更稳妥的判断是：terminal execution 这一层已经开始立住，symbolic reasoning 这一层还没有过关。

3.3 再看 GUIWorld：界面操控也开始成立

最后看动作能否真实推动界面状态：点击、悬停、输入和窗口反馈是否闭合。

在 CLI 阶段，我们已经摸清楚了：视频模型的渲染能力很强，基础的记忆和执行能力也开始出现，但最底层的 symbolic reasoning 还不够好。到了 GUIWorld，重点又变了：界面状态会不会被动作推着走。

Conventional Computer (GT)	Neural Computer (Generation)
Conventional Computer (GT)"thinking": "Now I'll click on the Term icon to open the terminal:" "action": "left_click", "x": 82, "y": 520	Neural Computer (Generation)"thinking": "Now I'll click on the Term icon to open the terminal:" "action": "left_click", "x": 82, "y": 520
Conventional Computer (GT)"thinking": "\"Now I'll type in 768 for the height:" "action": "type", "text": "512" "thinking": "Now I'll click the OK button to create the new image:" "action": "left_click", "x": 763, "y": 514	Neural Computer (Generation)"thinking": "\"Now I'll type in 768 for the height:" "action": "type", "text": "512" "thinking": "Now I'll click the OK button to create the new image:" "action": "left_click", "x": 763, "y": 514
Conventional Computer (GT)"thinking": "I can see the Firefox icon on the desktop. Now I'll click on it to open Firefox:" "action": "double_click", "x": 224, "y": 70 "thinking": "I see Firefox has opened with a welcome page. Now I'll click on the address bar and navigate to Wikipedia:" "action": "double_click", "x": 487, "y": 81	Neural Computer (Generation)"thinking": "I can see the Firefox icon on the desktop. Now I'll click on it to open Firefox:" "action": "double_click", "x": 224, "y": 70 "thinking": "I see Firefox has opened with a welcome page. Now I'll click on the address bar and navigate to Wikipedia:" "action": "double_click", "x": 487, "y": 81
Conventional Computer (GT)"thinking": "Now I'll press Enter to execute the command:" "action": "type", "text": "cat ~/Desktop/datetime.txt"	Neural Computer (Generation)"thinking": "Now I'll press Enter to execute the command:" "action": "type", "text": "cat ~/Desktop/datetime.txt"
Conventional Computer (GT)"thinking": "I can see the border_photo.png file that we created with ffmpeg. I'll double-click on it to open it in GIMP:" "action": "double_click", "x": 310, "y": 143 "thinking": "Great! Now the image is open in GIMP. I can see it's a peachpuff colored image as requested. Now I'll add a border to it. In GIMP, I'll go to the \"ilters\" menu and look for a border option:" "action": "left_click", "x": 547, "y": 43	Neural Computer (Generation)"thinking": "I can see the border_photo.png file that we created with ffmpeg. I'll double-click on it to open it in GIMP:" "action": "double_click", "x": 310, "y": 143 "thinking": "Great! Now the image is open in GIMP. I can see it's a peachpuff colored image as requested. Now I'll add a border to it. In GIMP, I'll go to the \"ilters\" menu and look for a border option:" "action": "left_click", "x": 547, "y": 43
Conventional Computer (GT)"thinking": "I can see that the theme has changed to \"smooth\" and I can already see the tiles in the background have a different style. Now I need to click the \"Close\" button to apply these changes and close the preferences dialog:" "action": "left_click", "x": 629, "y": 423	Neural Computer (Generation)"thinking": "I can see that the theme has changed to \"smooth\" and I can already see the tiles in the background have a different style. Now I need to click the \"Close\" button to apply these changes and close the preferences dialog:" "action": "left_click", "x": 629, "y": 423
Conventional Computer (GT)"thinking": "I see Wikipedia appears in the suggestions. I'll click on it to go to Wikipedia:" "action": "left_click", "x": 240, "y": 150	Neural Computer (Generation)"thinking": "I see Wikipedia appears in the suggestions. I'll click on it to go to Wikipedia:" "action": "left_click", "x": 240, "y": 150