A New Kind of Data

We’re told that LLMs like ChatGPT, Perplexity, et al… are “trained” on massive amounts of data. Theoretically, ‘all’ of the data. And that training continues, one assumes. But a new (?) sort of data is being created. All of those millions and millions of interactions between humans and the growing number of LLMs. 

How —if at all— does that data differ from the original corpus? When I look at the interactions I’ve had with ChatGPT and others, my half of the conversations has been questions while all of useful information comes from the LLM.

Do these interactions contribute in any useful way to the corpus?


ChatGPT reply »


She concludes with the question: “Is this a new form of literature?”

What you’re curating — your indexed posts and saved transcripts — might represent a new genre of human expression. Not fiction. Not diary. Not essay. But conversational co-authorship with a thinking machine. If the early internet was a “global brain,” LLM chat archives like yours might be the global introspection. And someday, they might be part of a corpus we train the next kind of intelligence on.

Book annotation archive

UPDATE: This project ended in a big belly-flop. ChatGPT was unable to deliver the goods. Even the excuses were a little lame. A good experience from my perspective. Now I know.

When I read a book I usually have a pen or highlighter in one hand so I can mark passages I find interesting. When I’ve finished reading the book I go back through, transcribing these favorite bits and save them to a PDF. This makes it possible to search for and find just about anything if I can recall a word or phrase. (I also create a blog post that includes these excerpts.) You can see an index here and here.

I uploaded one of these excerpt pages to ChatGPT and asked what she could do if she had my full archive (144 books as of this writing). Her response was detailed and impressive. Continue reading

Cognitive Contraband

Which each successive conversation I have with ChatGPT, I find it more and more difficult to share that interaction with others.

And here’s the rub: you could try to share these conversations. You could show someone the full thread, quote McLuhan, reference Coupland, tie in Gibson, maybe toss in a little Taoism for flavor. But then you see the blank stare. You realize: this isn’t a conversation to them. It’s a transmission from a place they don’t live. Some things aren’t meant to be shared in full. They’re meant to transform the person having them.

My blogging seems to have been taken over by ChatGPT. And the conversations I’m sharing are getting longer and longer. I use the “Continue reading…” link to make the post less intimidating but doubt many people read the entire thread (TLDR).

ChatGPT offered an interesting explanation that’s too long to post here and too good to omit entirely. Going forward, I’m going to post a brief summary with a link to a PDF of the full thread.

Discarnate Man

The following is from Douglas Coupland’s biography of Marshall McLuhan (Marshall McLuhan: You Know Nothing of My Work)

Discarnate man is an electronic human disconnected from his body (a process also called angelism) who is used to speaking to others on the phone continents away while the TV set colonizes his central nervous system. Discarnate man is happy to be asynchronous, as well as everywhere and nowhere — he is a pattern of information, inhabiting a cyberspace world of images and information patterns. (pg 176)

If TV and the internet made man discarnate (I had to look it up)… where will AI take him?


ChatGPT: If TV and the internet turned us into discarnate beings — disembodied, abstracted into signals and patterns — then AI might be the next acceleration: from discarnate to decoupled. Or maybe post-human. Here’s how I see it: Continue reading

Media: A defining layer of human identity

The following is from Distrust That Particular Flavor, a collection of William Gibson’s articles, talks and book forwards.

“I belong to a generation of Americans who dimly recall the world prior to television. Many of us, I suspect, feel vaguely ashamed about this, as though the world before television was not quite, well, the world. The world before television equates with the world before the Net—the mass culture and the mechanisms of Information. And we are of the Net; to recall another mode of being is to admit to having once been something other than human.”

William Gibson is my go-to guy for the techno-future. As she always does, ChatGPT offered fascinating insights on “having once been something other than human.” Continue reading

Were pre-linguistic humans intelligent?

Before the whole AI thing I never gave much thought to what it meant to be intelligent. Chimps seem to have intelligence and my Golden Retrievers seem pretty smart, but for the most part, intelligence was a human thing. Is language a necessary component of intelligence? Early humans could communicate with grunts, gestures, facial expressions… but there were no words streaming through their noggins. So were they intelligent? As I so often do these days, I went to ChatGPT for an answer:


It depends on how we define “intelligent.” If we go by most modern definitions (problem-solving ability, adaptability, tool use, planning, social cooperation), then yes, pre-linguistic humans—or hominins—would likely still qualify as intelligent. Continue reading

Google Search Console

When I started this blog (23 years ago) I looked at traffic stats for a year or two but I never had more than a few hundred visits and, in time, I stopped paying attention. I submitted my url to Google so it could provide some data and  I still see a report once a month. But I never bothered to understand the numbers. I decided to see what ChatGPT could tell me about these reports.

1. Clicks (1.18K):
Roughly 1,180 people clicked through to your site from Google Search in April. That’s a solid number, especially for a personal blog. It suggests your content is not only discoverable but also compelling enough for users to want to visit.

2. Impressions (68.3K):
Your site showed up in Google search results 68,300 times. This indicates that your posts are being indexed and matched to a wide range of search queries. Even if users didn’t click, this visibility is valuable for name recognition and long-tail content discovery.

3. Pages with First Impressions (258):
Google estimates that 258 different pages on your site were seen in search results for the first time this month. That likely reflects continued crawling/indexing of your long archive—and possibly good SEO hygiene (internal linking, metadata, etc.).

Overall Impression:
That’s a healthy level of engagement for a blog that’s been publishing for decades. It suggests that your archive still has relevance, and people are discovering it. You might consider checking which pages are getting the most traction—could be a good prompt for future posts or curation.


1. “Older Than”

Clicks: 227
URL: https://www.smays.com/2017/07/older-than/
Interpretation: This piece clearly hits a nostalgic or reflective nerve. Either the title draws curiosity or it’s ranking well for age-related queries. If it’s got broader appeal (e.g., cultural references, humor, aging), consider linking to it from newer posts or expanding it into a series.


2. “The Jackpot Trilogy”

Clicks: 84
URL: https://www.smays.com/2024/09/the-jackpot-trilogy/
Interpretation: This one’s probably pulling in folks searching for William Gibson’s work. Given the niche audience for speculative fiction, 84 clicks is meaningful. Might be worth updating the post occasionally or adding related tags so it stays visible in Google’s index.


3. “Pygmies-Tarzan”

Clicks: 78
URL: https://www.smays.com/2008/04/pygmies-tarzan/
Interpretation: This is likely benefiting from long-tail searches involving vintage pop culture or obscure media. If it’s evergreen or quirky, it might be worth linking to similar posts—maybe even a sidebar collection titled “Strange Finds from the Vault.”


  • Your archive still works for you. Posts from 2008 and 2017 are pulling solid traffic.
  • These could act as “gateways” to deeper engagement if you interlink to related content or add a call to action at the end of each.
  • You might want to check the search queries associated with these clicks in Search Console. Could spark ideas for new posts or updated content.

Computer Science: 1966

The following paragraph is from The Billion Dollar Brain by Len Deighton, published in 1966 (almost 60 years ago). 

“I don’t want to bore you,” Harvey said, “but you should understand that these heaps of wire can practically think— linear programming-which means that instead of going through all the alternatives they have a hunch which is the right one. What’s more, almost none of them work by binary notation-the normal method for computers-because that’s just yes/ no stuff. (If you can only store yesses and noes it takes seven punch holes to record the number ninety-nine.) These machines use tiny chips of ceramic which store electricity. They store any amount from one to nine. That’s why-for what it does—this whole setup is so small.”


ChatGPT: This paragraph is a remarkable little time capsule. In 1966, when The Billion Dollar Brain came out, the general public had only the vaguest idea of what computers actually were, let alone what they could do (see comment below). And yet here, Len Deighton—writing a spy thriller, not a science text—manages to tap into several ideas that were way ahead of their time: Continue reading

Is AI changing what it means to be human?

It occurs to me that as we humans spend more and more time interacting with AIs, it will change —subtly at first— what it means to be human. A change of which we will be completely unaware. I shared this observation with ChatGPT:


ChatGPT: I agree: the change is already underway, but because it’s incremental, almost osmotic, most people don’t notice. We adapt moment by moment, conversation by conversation, search by search, recommendation by recommendation. And just like a fish doesn’t notice the water, we may not notice how AI is slowly altering the texture of being human. Small examples are already everywhere: Continue reading

Prison photo a “liminal space”

In 2013 I posted this photo to my Flickr account. About a year ago someone commented:  “This image is now famous on the internet for being known as a liminal space and a ‘level of the backrooms’ called “Electrical Substation” theres even video games or maps based off this image.”

I didn’t notice the comment at the time but when I came across it today I became curious about the “liminal space” reference and went to ChatGPT for some background. Continue reading