Extraction Gym Report

Generated 2026-03-21 16:19:47 — extraction — 6 sites evaluated

6
Total Sites
4
OK (≥0.8)
1
WARN (≥0.5)
1
FAIL (<0.5)
0.81
Avg Overall

Score Matrix

Site Category Overall Intrusion Anchor Must-not Timing
substack_mauboussin newsletter 0.35 (FAIL) 0.00 0.00 0.00 567ms
medium_article blog 0.50 (WARN) 0.00 0.00 0.00 178ms
reuters_article news 1.00 (OK) 0.00 0.00 0.00 2,684ms
python_docs docs 1.00 (OK) 0.00 0.00 0.00 3,405ms
paulgraham blog 1.00 (OK) 0.00 0.00 0.00 4,622ms
substack_waxman newsletter 1.00 (OK) 0.00 0.00 0.00 9,747ms

Category Breakdown

Category Sites Avg Overall Avg Intrusion Avg Anchor Avg Must-not
blog 2 0.75 0.00 0.00 0.00
docs 1 1.00 0.00 0.00 0.00
news 1 1.00 0.00 0.00 0.00
newsletter 2 0.68 0.00 0.00 0.00

Site Details

FAIL substack_mauboussin newsletter 0.35 🔗
Overall
0.35
Intrusion
0.00
Anchor
0.00
Must-not
0.00
Chars: 122,556 raw → 76 extracted → 76 cleaned (0% of raw)
Fetch
522ms
Extract
45ms
Clean
0ms
Text preview (extracted vs cleaned)

Extracted (76 chars)

Michael Mauboussin | Substack

SubscribeSign in

## Page not found

No posts

Cleaned (76 chars)

Michael Mauboussin | Substack

SubscribeSign in

## Page not found

No posts
WARN medium_article blog 0.50 🔗
Overall
0.50
Intrusion
0.00
Anchor
0.00
Must-not
0.00
Chars: 9,909 raw → 61 extracted → 61 cleaned (1% of raw)
Fetch
176ms
Extract
2ms
Clean
0ms
Text preview (extracted vs cleaned)

Extracted (61 chars)

# Just a moment...

Enable JavaScript and cookies to continue

Cleaned (61 chars)

# Just a moment...

Enable JavaScript and cookies to continue
OK reuters_article news 1.00 🔗
Overall
1.00
Intrusion
0.00
Anchor
0.00
Must-not
0.00
Chars: 591,790 raw → 2,235 extracted → 2,235 cleaned (0% of raw)
Fetch
1ms
Extract
62ms
Clean
2,621ms
Text preview (extracted vs cleaned)

Extracted (2,235 chars)

**Published:** 2026-03-19T21:14:50.941000+00:00

* March 19, 2026

  [As OpenClaw enthusiasm grips China, schoolkids and retirees alike raise 'lobsters'](/technology/openclaw-enthusiasm-grips-china-schoolkids-retirees-alike-raise-lobsters-2026-03-19/)

  The AI ‌agent, which can connect several hardware and software tools and learn from the data produced with much less human intervention than a chatbot, has gone viral in China.

  [![Setup session for AutoClaw, a local version of the AI agent OpenClaw developed by Zhipu, in Beijing](https://www.reuters.com/resizer/v2/Q22722PP5RMZDE3LHNPJUMV6TI.jpg?auth=cac68656a34748429d90ace6bf62465904fc603a5440f7da844e1936c563685b&width=1200&quality=80)](/technology/openclaw-enthusiasm-grips-china-schoolkids-retirees-alike-raise-lobsters-2026-03-19/)
* March 20, 2026

  [US charges three tied to Super Micro Computer with helping smuggle billions of dollars of AI chips to China](/world/us-charges-three-people-with-conspiring-divert-ai-tech-china-2026-03-19/)
* March 19, 2026

  [Nvidia to sell 1 million chips to Amazon by end of 2027 in cloud deal](/business/retail-consumer/nvidia-sell-1-million-chips-amazon-by-end-2027-cloud-deal-2026-03-19/)
* March 19, 2026

  [Hegseth wants Pentagon to dump Anthropic's Claude, but military users say it's not so easy](/business/hegseth-wants-pentagon-dump-anthropics-claude-military-users-say-its-not-so-easy-2026-03-19/)
* ago

  [OpenAI to nearly double workforce to 8,000 by end-2026, FT reports](/business/openai-nearly-double-workforce-8000-by-end-2026-ft-reports-2026-03-21/)

---

## Reuters Events Logo

* Event

  [Momentum AI New York, opens new tab](https://events.reutersevents.com/momentum/nyc?utm_source=reutersAIpage)

  27-28 Apr 2026 · New York, USA
* Event

  [Momentum AI London, opens new tab](https://events.reutersevents.com/momentum/london?utm_source=reutersAIpage)

  29-30 Jun 2026 · London, USA
* Event

  [Momentum AI Austin, opens new tab](https://events.reutersevents.com/momentum?utm_source=reutersAIpage)

  24-25 Sep 2026 · Austin, USA
* Event

  [Momentum AI Finance, opens new tab](https://events.reutersevents.com/momentum/finance?utm_source=reutersAIpage)

  16-17 Nov 2026 · New York, USA

## More Coverage

Cleaned (2,235 chars)

**Published:** 2026-03-19T21:14:50.941000+00:00

* March 19, 2026

  [As OpenClaw enthusiasm grips China, schoolkids and retirees alike raise 'lobsters'](/technology/openclaw-enthusiasm-grips-china-schoolkids-retirees-alike-raise-lobsters-2026-03-19/)

  The AI ‌agent, which can connect several hardware and software tools and learn from the data produced with much less human intervention than a chatbot, has gone viral in China.

  [![Setup session for AutoClaw, a local version of the AI agent OpenClaw developed by Zhipu, in Beijing](https://www.reuters.com/resizer/v2/Q22722PP5RMZDE3LHNPJUMV6TI.jpg?auth=cac68656a34748429d90ace6bf62465904fc603a5440f7da844e1936c563685b&width=1200&quality=80)](/technology/openclaw-enthusiasm-grips-china-schoolkids-retirees-alike-raise-lobsters-2026-03-19/)
* March 20, 2026

  [US charges three tied to Super Micro Computer with helping smuggle billions of dollars of AI chips to China](/world/us-charges-three-people-with-conspiring-divert-ai-tech-china-2026-03-19/)
* March 19, 2026

  [Nvidia to sell 1 million chips to Amazon by end of 2027 in cloud deal](/business/retail-consumer/nvidia-sell-1-million-chips-amazon-by-end-2027-cloud-deal-2026-03-19/)
* March 19, 2026

  [Hegseth wants Pentagon to dump Anthropic's Claude, but military users say it's not so easy](/business/hegseth-wants-pentagon-dump-anthropics-claude-military-users-say-its-not-so-easy-2026-03-19/)
* ago

  [OpenAI to nearly double workforce to 8,000 by end-2026, FT reports](/business/openai-nearly-double-workforce-8000-by-end-2026-ft-reports-2026-03-21/)

---

## Reuters Events Logo

* Event

  [Momentum AI New York, opens new tab](https://events.reutersevents.com/momentum/nyc?utm_source=reutersAIpage)

  27-28 Apr 2026 · New York, USA
* Event

  [Momentum AI London, opens new tab](https://events.reutersevents.com/momentum/london?utm_source=reutersAIpage)

  29-30 Jun 2026 · London, USA
* Event

  [Momentum AI Austin, opens new tab](https://events.reutersevents.com/momentum?utm_source=reutersAIpage)

  24-25 Sep 2026 · Austin, USA
* Event

  [Momentum AI Finance, opens new tab](https://events.reutersevents.com/momentum/finance?utm_source=reutersAIpage)

  16-17 Nov 2026 · New York, USA

## More Coverage
OK python_docs docs 1.00 🔗
Overall
1.00
Intrusion
0.00
Anchor
0.00
Must-not
0.00
Chars: 24,565 raw → 3,746 extracted → 2,473 cleaned (10% of raw)
Fetch
0ms
Extract
41ms
Clean
3,364ms
Text preview (extracted vs cleaned)

Extracted (3,746 chars)

asyncio — Asynchronous I/O — Python 3.14.3 documentation

# `asyncio` — Asynchronous I/O[¶](#module-asyncio "Link to this heading")

---

asyncio is a library to write **concurrent** code using
the **async/await** syntax.

asyncio is used as a foundation for multiple Python asynchronous
frameworks that provide high-performance network and web-servers,
database connection libraries, distributed task queues, etc.

asyncio is often a perfect fit for IO-bound and high-level
**structured** network code.

See also

[A Conceptual Overview of asyncio](../howto/a-conceptual-overview-of-asyncio.html#a-conceptual-overview-of-asyncio)
:   Explanation of the fundamentals of asyncio.

asyncio provides a set of **high-level** APIs to:

* [run Python coroutines](asyncio-task.html#coroutine) concurrently and
  have full control over their execution;
* perform [network IO and IPC](asyncio-stream.html#asyncio-streams);
* control [subprocesses](asyncio-subprocess.html#asyncio-subprocess);
* distribute tasks via [queues](asyncio-queue.html#asyncio-queues);
* [synchronize](asyncio-sync.html#asyncio-sync) concurrent code;

Additionally, there are **low-level** APIs for
*library and framework developers* to:

* create and manage [event loops](asyncio-eventloop.html#asyncio-event-loop), which
  provide asynchronous APIs for [networking](asyncio-eventloop.html#loop-create-server),
  running [subprocesses](asyncio-eventloop.html#loop-subprocess-exec),
  handling [OS signals](asyncio-eventloop.html#loop-add-signal-handler), etc;
* implement efficient protocols using
  [transports](asyncio-protocol.html#asyncio-transports-protocols);
* [bridge](asyncio-future.html#asyncio-futures) callback-based libraries and code
  with async/await syntax.

[Availability](intro.html#availability): not WASI.

This module does not work or is not available on WebAssembly. See
[WebAssembly platforms](intro.html#wasm-availability) for more information.

asyncio REPL

You can experiment with an `asyncio` concurrent context in the [REPL](../glossary.html#term-REPL):

```
$ python -m asyncio
asyncio REPL ...
Use "await" directly instead of "asyncio.run()".
Type "help", "copyright", "credits" or "license" for more information.
>>> import asyncio
>>> await asyncio.sleep(10, result='hello')
'hello'
```

This REPL provides limited compatibility with [`PYTHON_BASIC_REPL`](../using/cmdline.html#envvar-PYTHON_BASIC_REPL).
It is recommended that the default REPL is used
for full functionality and the latest features.

Raises an [auditing event](sys.html#auditing) `cpython.run_stdin` with no arguments.

Changed in version 3.12.5: (also 3.11.10, 3.10.15, 3.9.20, and 3.8.20)
Emits audit events.

Changed in version 3.13: Uses PyREPL if possible, in which case [`PYTHONSTARTUP`](../using/cmdline.html#envvar-PYTHONSTARTUP) is
also executed. Emits audit events.

Reference

High-level APIs

* [Runners](asyncio-runner.html)
* [Coroutines and tasks](asyncio-task.html)
* [Streams](asyncio-stream.html)
* [Synchronizati

Cleaned (2,473 chars)

asyncio — Asynchronous I/O — Python 3.14.3 documentation

# `asyncio` — Asynchronous I/O

asyncio is a library to write **concurrent** code using
the **async/await** syntax.

asyncio is used as a foundation for multiple Python asynchronous
frameworks that provide high-performance network and web-servers,
database connection libraries, distributed task queues, etc.

asyncio is often a perfect fit for IO-bound and high-level
**structured** network code.

See also

A Conceptual Overview of asyncio
:   Explanation of the fundamentals of asyncio.

asyncio provides a set of **high-level** APIs to:

* run Python coroutines concurrently and
  have full control over their execution;
* perform network IO and IPC;
* control subprocesses;
* distribute tasks via queues;
* synchronize concurrent code;

Additionally, there are **low-level** APIs for
*library and framework developers* to:

* create and manage event loops, which
  provide asynchronous APIs for networking,
  running subprocesses,
  handling OS signals, etc;
* implement efficient protocols using
  transports;
* bridge callback-based libraries and code
  with async/await syntax.

Availability: not WASI.

This module does not work or is not available on WebAssembly. See
WebAssembly platforms for more information.

asyncio REPL

You can experiment with an `asyncio` concurrent context in the REPL:

```
$ python -m asyncio
asyncio REPL ...
Use "await" directly instead of "asyncio.run()".
Type "help", "copyright", "credits" or "license" for more information.
>>> import asyncio
>>> await asyncio.sleep(10, result='hello')
'hello'
```

This REPL provides limited compatibility with `PYTHON_BASIC_REPL`.
It is recommended that the default REPL is used
for full functionality and the latest features.

Raises an auditing event `cpython.run_stdin` with no arguments.

Changed in version 3.12.5: (also 3.11.10, 3.10.15, 3.9.20, and 3.8.20)
Emits audit events.

Changed in version 3.13: Uses PyREPL if possible, in which case `PYTHONSTARTUP` is
also executed. Emits audit events.

Reference

High-level APIs

* Runners
* Coroutines and tasks
* Streams
* Synchronization Primitives
* Subprocesses
* Queues
* Exceptions
* Call Graph Introspection

Low-level APIs

* Event loop
* Futures
* Transports and Protocols
* Policies
* Platform Support
* Extending

Guides and Tutorials

* High-level API Index
* Low-level API Index
* Developing with asyncio

Note

The source code for asyncio can be found in Lib/asyncio/.
OK paulgraham blog 1.00 🔗
Overall
1.00
Intrusion
0.00
Anchor
0.00
Must-not
0.00
Chars: 9,280 raw → 3,249 extracted → 3,058 cleaned (33% of raw)
Fetch
0ms
Extract
10ms
Clean
4,612ms
Text preview (extracted vs cleaned)

Extracted (3,249 chars)

# Writes and Write-Nots

|  |  |  |  |
| --- | --- | --- | --- |
|  |  | |  | | --- | | Writes and Write-Nots  October 2024  I'm usually reluctant to make predictions about technology, but I feel fairly confident about this one: in a couple decades there won't be many people who can write.  One of the strangest things you learn if you're a writer is how many people have trouble writing. Doctors know how many people have a mole they're worried about; people who are good at setting up computers know how many people aren't; writers know how many people need help writing.  The reason so many people have trouble writing is that it's fundamentally difficult. To write well you have to think clearly, and thinking clearly is hard.  And yet writing pervades many jobs, and the more prestigious the job, the more writing it tends to require.  These two powerful opposing forces, the pervasive expectation of writing and the irreducible difficulty of doing it, create enormous pressure. This is why eminent professors often turn out to have resorted to plagiarism. The most striking thing to me about these cases is the pettiness of the thefts. The stuff they steal is usually the most mundane boilerplate — the sort of thing that anyone who was even halfway decent at writing could turn out with no effort at all. Which means they're not even halfway decent at writing.  Till recently there was no convenient escape valve for the pressure created by these opposing forces. You could pay someone to write for you, like JFK, or plagiarize, like MLK, but if you couldn't buy or steal words, you had to write them yourself. And as a result nearly everyone who was expected to write had to learn how.  Not anymore. AI has blown this world open. Almost all pressure to write has dissipated. You can have AI do it for you, both in school and at work.  The result will be a world divided into writes and write-nots. There will still be some people who can write. Some of us like it. But the middle ground between those who are good at writing and those who can't write at all will disappear. Instead of good writers, ok writers, and people who can't write, there will just be good writers and people who can't write.  Is that so bad? Isn't it common for skills to disappear when technology makes them obsolete? There aren't many blacksmiths left, and it doesn't seem to be a problem.  Yes, it's bad. The reason is something I mentioned earlier: writing is thinking. In fact there's a kind of thinking that can only be done by writing. You can't make this point better than Leslie Lamport did: If you're thinking without writing, you only think you're thinking. So a world divided into writes and write-nots is more dangerous than it sounds. It will be a world of thinks and think-nots. I know which half I want to be in, and I bet you do too.  This situation is not unprecedented. In preindustrial times most people's jobs made them strong. Now if you want to be strong, you work out. So there are still stron

Cleaned (3,058 chars)

# Writes and Write-Nots

October 2024

I'm usually reluctant to make predictions about technology, but I feel fairly confident about this one: in a couple decades there won't be many people who can write.

One of the strangest things you learn if you're a writer is how many people have trouble writing. Doctors know how many people have a mole they're worried about; people who are good at setting up computers know how many people aren't; writers know how many people need help writing.

The reason so many people have trouble writing is that it's fundamentally difficult. To write well you have to think clearly, and thinking clearly is hard.

And yet writing pervades many jobs, and the more prestigious the job, the more writing it tends to require.

These two powerful opposing forces, the pervasive expectation of writing and the irreducible difficulty of doing it, create enormous pressure. This is why eminent professors often turn out to have resorted to plagiarism. The most striking thing to me about these cases is the pettiness of the thefts. The stuff they steal is usually the most mundane boilerplate — the sort of thing that anyone who was even halfway decent at writing could turn out with no effort at all. Which means they're not even halfway decent at writing.

Till recently there was no convenient escape valve for the pressure created by these opposing forces. You could pay someone to write for you, like JFK, or plagiarize, like MLK, but if you couldn't buy or steal words, you had to write them yourself. And as a result nearly everyone who was expected to write had to learn how.

Not anymore. AI has blown this world open. Almost all pressure to write has dissipated. You can have AI do it for you, both in school and at work.

The result will be a world divided into writes and write-nots. There will still be some people who can write. Some of us like it. But the middle ground between those who are good at writing and those who can't write at all will disappear. Instead of good writers, ok writers, and people who can't write, there will just be good writers and people who can't write.

Is that so bad? Isn't it common for skills to disappear when technology makes them obsolete? There aren't many blacksmiths left, and it doesn't seem to be a problem.

Yes, it's bad. The reason is something I mentioned earlier: writing is thinking. In fact there's a kind of thinking that can only be done by writing. You can't make this point better than Leslie Lamport did: If you're thinking without writing, you only think you're thinking. So a world divided into writes and write-nots is more dangerous than it sounds. It will be a world of thinks and think-nots. I know which half I want to be in, and I bet you do too.

This situation is not unprecedented. In preindustrial times most people's jobs made them strong. Now if you want to be strong, you work out. So there are still strong people, but only those who choose to be.

It will be the same with writing. There wi
OK substack_waxman newsletter 1.00 🔗
Overall
1.00
Intrusion
0.00
Anchor
0.00
Must-not
0.00
Chars: 178,788 raw → 11,273 extracted → 10,618 cleaned (6% of raw)
Fetch
2ms
Extract
58ms
Clean
9,687ms
Text preview (extracted vs cleaned)

Extracted (11,273 chars)

**Published:** 2026-02-16T20:31:59+00:00

# Money at Machine Speed

[![David Waxman's avatar](https://substackcdn.com/image/fetch/$s_!IHx9!,w_36,h_36,c_fill,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c133752-2c28-4870-86c6-4a830f67f9cd_4284x4284.jpeg)](https://substack.com/@waxmand)

[David Waxman](https://substack.com/@waxmand)

Feb 16, 2026

6

3

1

Share

---

Last week, Coinbase launched the first crypto wallet infrastructure built specifically for AI agents.¹ Hours later, Stripe announced it was adopting the same underlying protocol.² Two of the most important payments companies in the world shipped agent payment products on the same day.

Thanks for reading! Subscribe for free to receive new posts and support my work.

Subscribe

We’ve spent the last few months researching this space at TenOneTen Ventures, and we think most people are underestimating how fast it’s moving.

## Agents Have a Money Problem

We’ve gotten good at building AI agents that can reason, plan, and coordinate with each other. Impressively good. But there’s a comically basic thing most agents can’t do: pay for stuff.

An agent finds the API it needs but can’t swipe a credit card. It negotiates with another agent but has no way to settle. It identifies cheaper cloud compute but can’t buy it. Every time money enters the picture, the agent stops and waits for a human.

This is a little like building a fleet of self-driving trucks and then requiring a person to hand cash to every toll booth operator. The autonomy is real until the transaction isn’t.

McKinsey projects $1 trillion in US retail agentic commerce by 2030, with global projections reaching $3–5 trillion.³ ChatGPT already handles over 50 million daily shopping queries.⁴ The demand is there. The plumbing to actually move money between machines is what’s been missing.

Until, apparently, last week.

## What Agents Actually Need to Buy

Before we get to who’s building what, it’s worth stopping on a point that most coverage of this space glosses over: the things agents need to pay for mostly don’t look like anything humans buy.

Some of it is familiar. An agent booking a flight or ordering supplies — that’s just e-commerce with a different buyer. But the bigger and more interesting category is transactions that have no human equivalent at all.

An agent needs a real-time pricing lookup from another agent’s database. That’s a fraction of a cent, settled instantly. It needs ten seconds of GPU time to run an inference. It needs to check a credit score, pull weather data for a logistics route, or access a proprietary dataset to answer a question. Each of these is a microtransaction — maybe $0.001, maybe $0.01 — happening thousands of times per minute across millions of agents.

There’s no shopping cart here. No checkout flow. No “add to cart.” It’s closer to a stock exchange than a store. Agents are buying compute, data, and capabilities from each other

Cleaned (10,618 chars)

**Published:** 2026-02-16T20:31:59+00:00

# Money at Machine Speed

David Waxman

Feb 16, 2026

Last week, Coinbase launched the first crypto wallet infrastructure built specifically for AI agents.¹ Hours later, Stripe announced it was adopting the same underlying protocol.² Two of the most important payments companies in the world shipped agent payment products on the same day.

We’ve spent the last few months researching this space at TenOneTen Ventures, and we think most people are underestimating how fast it’s moving.

## Agents Have a Money Problem

We’ve gotten good at building AI agents that can reason, plan, and coordinate with each other. Impressively good. But there’s a comically basic thing most agents can’t do: pay for stuff.

An agent finds the API it needs but can’t swipe a credit card. It negotiates with another agent but has no way to settle. It identifies cheaper cloud compute but can’t buy it. Every time money enters the picture, the agent stops and waits for a human.

This is a little like building a fleet of self-driving trucks and then requiring a person to hand cash to every toll booth operator. The autonomy is real until the transaction isn’t.

McKinsey projects $1 trillion in US retail agentic commerce by 2030, with global projections reaching $3–5 trillion.³ ChatGPT already handles over 50 million daily shopping queries.⁴ The demand is there. The plumbing to actually move money between machines is what’s been missing.

Until, apparently, last week.

## What Agents Actually Need to Buy

Before we get to who’s building what, it’s worth stopping on a point that most coverage of this space glosses over: the things agents need to pay for mostly don’t look like anything humans buy.

Some of it is familiar. An agent booking a flight or ordering supplies — that’s just e-commerce with a different buyer. But the bigger and more interesting category is transactions that have no human equivalent at all.

An agent needs a real-time pricing lookup from another agent’s database. That’s a fraction of a cent, settled instantly. It needs ten seconds of GPU time to run an inference. It needs to check a credit score, pull weather data for a logistics route, or access a proprietary dataset to answer a question. Each of these is a microtransaction — maybe $0.001, maybe $0.01 — happening thousands of times per minute across millions of agents.

There’s no shopping cart here. No checkout flow. No “add to cart.” It’s closer to a stock exchange than a store. Agents are buying compute, data, and capabilities from each other in real time, programmatically, at a scale and speed that human-designed payment systems were never built to handle.

This is also why — and I’ll admit my partners had to convince me of this — crypto ends up being necessary. My initial instinct was that traditional rails would be fine. But think about it: Visa’s minimum transaction fees would cost more than the thing being purchased. You can’t run a $0.001 payment through a syst