← トップへ戻る

コミュニティ投稿 ·考察・分析 ·完成記事 ·AIによる読み解き

ブラウザ上で動作する画像修復モデル、Moebiusの移植がもたらす可能性とは？

Simon Willison氏が、WebGPUを使ってブラウザ上で動作する画像修復モデル Moebius の移植に成功

元記事タイトル: ブラウザ上で動作する画像修復モデル Moebius の移植

Simon Willison's Weblog 2026年06月22日

個人の見解・体験を含む可能性があります。公式発表ではないため、仕様変更や正式な発表内容は必ず元情報も確認してください。

ANALYSIS 考察・分析 / Opinion

Field Note 読む前に確認

3行まとめ

Moebiusは軽量な画像修復フレームワーク
WebGPUを用いてブラウザで実行可能に
非正方形の画像にも対応

こんな人に関係ある話

Pythonエンジニア画像編集ツール開発者 Web技術者

信頼度メモ

Simon Willison's Weblog の記事（個人またはコミュニティの解釈を含む）

記事の読み解き Reading

元記事を材料に、要点、編集視点、良い点と懸念点を読みやすい順に整理しています。

Simon Willison氏は、Hacker Newsで発見した軽量な画像修復フレームワーク「Moebius: 0.2B」についてブログ記事を書きました。このモデルは、画像の特定領域をマークして削除し、その部分を埋めるための技術を持っています。Willison氏は、元々PyTorchとNVIDIA CUDAが必要なこのモデルをWebGPUを使ってブラウザ上で動作させる移植に成功しました。デモ版は「simonw.github.io/moebius-web/」で試すことができます。

編集部コメント

この記事は、AI技術の実装とブラウザベースでの動作を組み合わせた革新的なアプローチを示しています。WebGPUを通じて画像修復モデルがブラウザ上で動作するようになることで、ユーザーインターフェースやデプロイメントの柔軟性が向上します。

評価ポイント Assessment

良い点

画像修復技術の進歩
WebGPUによるブラウザでの実行
非正方形画像への対応

業界・社会への影響 Impact

この移植は、画像編集や修正をより簡単に、そして幅広いユーザーに提供する可能性を持っています。また、WebGPUの活用により、デスクトップアプリケーションに頼らずとも高度な画像処理が可能になり、モバイルデバイスでの利用も期待されます。

深堀り Deep Dive

前提知識

Moebiusは、画像修復や領域の削除と埋め込みを行うための軽量フレームワークです。元々はPyTorchとNVIDIA CUDAという特定のハードウェアとソフトウェア環境が必要でしたが、これにより深層学習モデルがブラウザ上で動作するようになる移植技術が開発されました。

何が新しいのか

この記事では、MoebiusをWebGPUを使用してブラウザ内で実行可能に移植したという新しい取り組みについて説明しています。これにより、従来は高性能なハードウェアが必要だった画像修復処理が、一般的なPCやモバイルデバイスのブラウザ上で低コストで簡単に利用できるようになりました。

用語解説

ポーティング(porting) あるシステムで動作するソフトウェアを異なる仕様や設計のシステムに適応させること

WebGPU ウェブブラウザ上で高性能なグラフィックス処理や計算を行うためのAPI

画像修復モデル欠損した部分のある画像からオリジナルの状態に近い形で補完する深層学習モデル

参照元 Sources

元記事と、深堀りで参照した情報源です。コミュニティ投稿やプレプリントでは、ここから根拠を確認できます。

ブラウザ上で動作する画像修復モデル Moebius の移植

Simon Willison's Weblog

https://simonwillison.net/2026/Jun/22/porting-moebius/#atom-everything

ポーティング（移植）とは - IT用語辞典 e-Words https://e-words.jp/w/%E3%83%9D%E3%83%BC%E3%83%86%E3%82%A3%E3%83%B3%E3%82%B0.html used in analysis

IT用語『porting』の意味とは？ https://it-notes.stylemap.co.jp/programs/what-is-porting-in-it-terminology/ used in analysis

ポーティングとは？ソフトウェアの移植と対応環境の解説 - LexiWord https://words.af-e.net/porting/

この記事の見取り図

読む前に確認
記事の読み解き
深堀り
参照元
AI要約について
関連記事

キーワード

Moebius WebGPU PyTorch NVIDIA CUDA 画像修復

AI要約について

本記事の要約・分類・読み解きにはAIを使用しています。内容確認に努めていますが、誤訳・解釈違い・元記事更新の反映漏れを含む可能性があります。重要な判断を行う場合は、必ず元記事もご確認ください。

記事データ

Source	コミュニティ投稿
Category	考察・分析
Status	完成記事
出典	Simon Willison's Weblog
公開日	2026-06-22

元記事の説明文

This morning <a href="https://news.ycombinator.com/item?id=48630171">on Hacker News</a> I saw <a href="https://hustvl.github.io/Moebius/">Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance</a>, describing a small but effective inpainting model - a model where you can mark regions of an image to remove and the model imagines what should fill the space. The released model <a href="https://github.com/hustvl/Moebius/blob/9310b76e368f5f7a8ecdf06493231af279c9973b/requirements.txt#L1">required PyTorch and NVIDIA CUDA</a>, but since it described itself as 0.2B I decided to try and get it running using WebGPU in a browser. TL;DR: I got it working, and you can try the demo at <a href="https://simonw.github.io/moebius-web/">simonw.github.io/moebius-web/</a>. Read on for the details. <h4 id="the-finished-tool">The finished tool</h4> Here's a video demo of the finished tool: <video controls="controls" height="1070" poster="https://static.simonwillison.net/static/2026/inpainting_1280_poster.jpg" preload="none" style="height: auto;" width="1280"> <source src="https://static.simonwillison.net/static/2026/inpainting_1280.mp4" type="video/mp4" /> </video> You can open any image in it (non-square images get letterboxed), highlight areas to remove, click the "Run inpaint" button and wait for the model to do its magic. <h4 id="a-parallel-agent-side-project">A parallel agent side-project</h4> My main project for today was landing a major feature in Datasette: a UI for creating and altering tables, as a follow-up to the <a href="https://simonwillison.net/2026/Jun/16/datasette/">insert and edit rows feature</a> I released last week. I was working on that in Codex Desktop (here's <a href="https://github.com/simonw/datasette/pull/2789">the PR</a>) and often found myself spending 5-10 minutes spinning my fingers waiting for it to complete a mid-sized refactor or add the finishing touches to a change to the UI. (An amusing thing about coding agents is that the harder a problem is the more time you have to get distracted while you wait for them to finish crunching!) So I decided to spin up Claude Code in a terminal window and see how far I could get at porting Moebius to the web. <h4 id="some-agentic-research-to-kick-off-the-project">Some agentic research to kick off the project</h4> My first step was to ask regular Claude about the feasibility of this project. In <a href="https://claude.ai/">Claude.ai</a>, which has the ability to clone repos from GitHub: <blockquote> <code>Clone [https://github.com/hustvl/Moebius/](https://github.com/hustvl/Moebius/) and tell me if they published the code and weights to run this model anywhere</code> </blockquote> (I hadn't spotted the link to the weights yet, that's tucked away in the "News" section.) Then: <blockquote> <code>For Moebius what are the options for running it right now - Python and NVIDIA CUDA only or other options too?</code> </blockquote> And: <blockquote> <code>Muse on the feasibility of porting it to Transformers.js or similar and running it in a browser</code> </blockquote> I like telling models to "muse on X", it's the shortest way I've found of expressing that I want them to contemplate a problem for me without providing them with a concrete goal. Here's <a href="https://claude.ai/share/551c3dc8-17ce-4a4b-a0c9-8cbded6c7bf1">that chat transcript</a>. I copied out the last answer and saved it as <a href="https://github.com/simonw/moebius-web/blob/main/research.md">research.md</a> for Claude Code to read later. Claude suggested using ONNX Runtime Web on the WebGPU backend - the layer below the <a href="https://huggingface.co/docs/transformers.js/en/index">Transformers.js</a> library I had suggested. That was enough to convince me it was worth setting Claude Code loose and seeing how far it could get. I usually start projects like this by gathering as much information as the coding agent might need as possible. Since I didn't expect this project to actually work I did everything in my <code>/tmp</code> folder: <div class="highlight highlight-source-shell"><pre>cd /tmp mkdir Moebius cd Moebius # Grab the Moebius python code git clone https://github.com/hustvl/Moebius # And the model weights (Claude figured this out): GIT_LFS_SKIP_SMUDGE=0 git clone \ https://huggingface.co/hustvl/Moebius Moebius-weights # Finally a couple of libraries we might use: git clone https://github.com/huggingface/transformers.js git clone https://github.com/microsoft/onnxruntime</pre></div> <h4 id="setting-off-claude-code">Setting off Claude Code</h4> I created a directory for the rest of the project and ran <code>git init</code> in that so Claude could start committing code notes: <div class="highlight highlight-source-shell"><pre>mkdir /tmp/Moebius/moebius-web cd /tmp/Moebius/moebius-web git init # Copy in that research.md from earlier git add research.md git commit -m "Initial research by Claude Opus 4.8"</pre></div> I fired up a <code>claude</code> instance in the <code>/tmp/Moebius</code> folder, the level above all of the research materials I had prepared for it. I prompted: <blockquote> <code>Read ./moebius-web/research.md - your goal is to port this model to ONNX and WebGPU so we can run it directly in a browser, with a simple UI</code> </blockquote> As it started to work I dropped in this follow-up (typos included): <blockquote> <code>Bulid this in /tmp/Moebius/moebius-web and commit early and often, also maintain a notes.md file in there with notes about what you figure out along the way - also start by writing out a plan.md in there and update that plan as oy work too</code> </blockquote> I often ask agents to keep notes like this - the end result is often interesting, both for myself and for the next agent session that touches the same project. Here's what that <a href="https://github.com/simonw/moebius-web/blob/main/notes.md">notes.md file</a> looked like at the end of the project. I kicked it off and went back to my main project, checking in occasionally to see how Claude was doing. When it looked like it might have something that worked I prompted: <blockquote> <code>Tell me what URL I can visit in my own browser to try this</code> </blockquote> Then I tried it out in Chrome and pasted some errors (and screenshots of errors) back into Claude Code. After a few rounds of this we had something that appeared to work! Time to put it on the internet so other people could use it. <blockquote> <code>How would we publish this to Hugging Face such that the model weights were on there and the HTML demo would show up in Hugging Face spaces?</code> </blockquote> Claude Code knows how to use the <code>hf</code> CLI tool, so I created a model repo on <a href="https://huggingface.co/">Hugging Face</a>, then <a href="https://huggingface.co/settings/tokens">created a token</a> that could write to that repo and dropped it into a <code>/tmp/Moebius/token.txt</code> file so Claude could use it. It published the 1.24GB of converted ONNX weights to <a href="https://huggingface.co/simonw/Moebius-ONNX">huggingface.co/simonw/Moebius-ONNX</a> for me. I'd seen other demos load weights into the browser from Hugging Face before, so I knew it was possible. I decided to host my own frontend code on GitHub Pages, so I said: <blockquote> <code>I want to publish the moebius-web folder to GitHub, minus the large files (so maybe minus the models/ folder), such that when I turn on GitHub Pages for that repo navigating to https://simonw.github.io/moebius-web/ serves the UI</code> </blockquote> Telling it the final URL was important in case it needed to fix the URLs in the demos that it was building so they would work when deployed to production. After a few more rounds of iteration, in between working on my main project, we got to a working, deployed version! Except... each time I reloaded the page it seemed to download ~1.3GB of model weights. Browser caching seemed pretty important for this! <blockquote> <code>anything clever we can do with serviceworkers or similar to help cache this stuff? It seems to reload every time, I am concerned that there might be something weird about the way HF redirects work that mean we don't benefit from browser caching</code> </blockquote> I knew that Transformers.js projects could handle this properly, so I grabbed a copy of the <a href="https://huggingface.co/spaces/Xenova/whisper-web">Whisper Web</a> demo, dropped it into <code>/tmp/Moebius/whisper-web</code> and said: <blockquote> <code>look in /tmp/Moebius/whisper-web (with a subagent) and see how they do this</code> </blockquote> That project was entirely obfuscated, built JavaScript files so I figured using a subagent would avoid spending the rest of my top-level token context deciphering those files. Claude figured out that it was using <code>caches.open("transformers-cache")</code> - the <a href="https://developer.mozilla.org/en-US/docs/Web/API/CacheStorage/open">CacheStorage API</a> - and <a href="https://github.com/simonw/moebius-web/commit/05c1cbc4894460a70a8bc1718ac6d152219e0f28#diff-fb89c342dfa36f544a2d16a885b0f3d1d49f436a7d0eaeb80505f80a1f922603">added that to our project</a>. I've shared the <a href="https://gisthost.github.io/?58039ba5c1ca3ed177e8659168996ee4">full Claude Code transcript</a> for this project (published using my <a href="https://github.com/simonw/claude-code-transcripts">claude-code-transcripts</a> tool). <h4 id="what-did-i-learn-from-all-of-this-">What did I learn from all of this?</h4> This definitely counts as vibe coding: I didn't look at a single line of code from the project, restricting my input to testing, suggesting small feature improvements (like a progress bar for the large file downloads) and pointing the model in the direction of examples of how I wanted things to work. Since I didn't write any code the amount I learned about the underlying technologies - WebGPU, ONNX, and the Moebius model itself - was very limited. As is usually the case with this kind of project the most important things I learned concerned what was possible: <ul> <li>Claude Opus 4.8 is capable of converting a PyTorch model to ONNX, publishing the result to Hugging Face and then building out a web application and interface that can load and execute that model.</li> <li>Chrome, Firefox and Safari are all now capable of running this kind of model - I tried it in all three.</li> <li>The CacheStorage API works with ~1.3GB model files.</li> <li>... which means we can have inpainting as a feature of a client-only web application! (If our users can tolerate the 1.3GB download.)</li> </ul> I felt like I should probably try and learn a little more about my project. I fired up <a href="https://claude.ai/">Claude.ai</a> and prompted: <blockquote> <code>Clone [https://github.com/simonw/moebius-web/](https://github.com/simonw/moebius-web/) and use it to teach me all about the model and ONNX and the process of converting a model to ONNX and WebGPU and basically everything I'd need to know in order to fully understand this repo</code> </blockquote> Here's <a href="https://claude.ai/share/d11b8f2b-a52d-4ca2-be75-a710eaf18572">the transcript</a> and the <a href="https://github.com/simonw/moebius-web/blob/main/understanding.md">understanding.md</a> Markdown file it created, which I've now added to the GitHub repo. I found the explanation of ONNX particularly enlightening: <blockquote> ONNX (Open Neural Network Exchange) is a portable, framework-neutral file format for neural networks. An <code>.onnx</code> file is essentially two things bundled together: <ol> <li> A computation graph — a directed graph of nodes, where each node is an operator (<code>Conv</code>, <code>MatMul</code>, <code>Add</code>, <code>Einsum</code>, <code>Softmax</code>, <code>Gather</code>, <code>Resize</code>, …) wired together by named tensors flowing between them. This is the "recipe" for the forward pass.</li> <li> The weights — the learned parameter tensors (the convolution kernels, the embedding table, etc.), stored as initializers in that same graph.</li> </ol> Crucially, ONNX describes what to compute, abstractly, without saying how or on what hardware. The operator set is versioned by an opset number (this repo uses opset 18), which pins down exactly which operators exist and what their semantics are. </blockquote> It turns out PyTorch has built in mechanisms for exporting to ONNX, as seen <a href="https://github.com/simonw/moebius-web/blob/080be6e737ec976130e260d34707d7d9b7f63d5b/python/export_onnx.py#L91">here in export_onnx.py</a>: <pre>torch.onnx.export( dec, (lat,), dec_path, opset_version=args.opset, input_names=["latent"], output_names=["image"], dynamic_axes={"latent": {0: "B"}, "image": {0: "B"}}, )</pre> It also included a <a href="https://github.com/simonw/moebius-web/blob/main/understanding.md#12-mini-glossary">handy glossary</a> and an only-slightly-broken <a href="https://github.com/simonw/moebius-web/blob/main/understanding.md#10-putting-the-whole-pipeline-in-one-picture">ASCII-art diagram</a> showing how the model pipeline fits together. Tags: <a href="https://simonwillison.net/tags/browsers">browsers</a>, <a href="https://simonwillison.net/tags/transformers-js">transformers-js</a>, <a href="https://simonwillison.net/tags/webgl">webgl</a>, <a href="https://simonwillison.net/tags/vibe-coding">vibe-coding</a>, <a href="https://simonwillison.net/tags/coding-agents">coding-agents</a>, <a href="https://simonwillison.net/tags/claude-code">claude-code</a>, <a href="https://simonwillison.net/tags/onnx">onnx</a>