Category: diary

A public record of the development of our ideas

  • ChatGPT can check facts

    transparency label: Human-only

    Mike Caulfield has released what he calls a Deep Background GPT. It is an AI fact-checking tool, available to all as a completely free GPT. Mike says:

    I just released a (largely) non-hallucinating rigorous AI-based fact-checker that anyone can use for free. And I don’t say that lightly: I literally co-wrote the book on using the internet to verify things. All you do is log into ChatGPT, click the link below, and put in a sentence or paragraph for it to fact check.

    https://chatgpt.com/g/g-684fa334fb0c8191910d50a70baad796-deep-background-fact-checks-and-context?model=o3

    I have experimented, and it seems a very useful tool.

    Before going to the link, make sure you have selected the o3 model of ChatGPT.

    Here is the link to his SubStack with all the details.

  • Terminology

    transparency label: Human-only

    Terminology was a vexed topic in our recent discussion. It dawned on me that smarter people than us must have faced this problem too, and sure enough Perplexity took me to the National Institute of Standards and Technology (NIST) in the US Department of Commerce. 

    Their paper AI Use Taxonomy, a Human-Centered Approach offers a good way forward.  We have adopted it and updated our Lab Framework accordingly.

    NIST also floated some terms that could help us think about the value of the outcomes from our work. 

    Usability: the extent to which a system, product or service can be used by  specified users to achieve specified goals with effectiveness, efficiency and satisfaction in a specified context of use.

    Human-centred quality: the extent to which requirements for usability,  accessibility, user experience and avoidance of harm from use are met.>>

  • Voice to meeting notes in 30 seconds

    Transparency label: Human-only

    A nice little use case for AI.

    The Anapoly team met this morning to talk about progress, harmonise our thinking, and firm up the plan for the next few weeks. It brought home the value of face to face discussion, because two interesting new ideas popped out of some animated debate.

    When discussion crystallised something worth recording, we paused for a few moments to dictate a note into my mobile phone. The app transcribed it in real time, but with poor punctuation and various transcription errors (I need to get a better app, perhaps!).

    Later, I gave the transcript to ChatGPT with the instruction: “The file 2025-06-28 meeting notes contains dictated notes from a meeting earlier today. Please give them a sensible format and correct obvious faults. If in doubt about anything, ask me to clarify.”

    ChatGPT did not need clarification. In the blink of an eye it produced a set of well written, clearly organised notes that would have taken me twenty or thirty minutes to produce to the same standard.

  • Lab Framework updated

    Substantial update to the Lab Framework, clarifying how we actually do labs!

  • A new way of working

    Transparency: AI-assisted (see justification below)

    A reflection on where I have got to after using ChatGPT and other AIs intensively for the past six weeks.

    In earlier times, my digital workspace was built around the usual office applications – Microsoft Office or Google Workspace, and their ilk. Now, it’s different. 

    Microsoft is going down the path of embedding AI (mainly ChatGPT) into its productivity suite. I have taken a different path, choosing to use ChatGPT, Perplexity, and NotebookLM independently. My workspace is now populated by a team of AI assistants, researchers, analysts, editors, and more, sitting alongside the standard office productivity apps.

    Where I would previously have started a piece of work in Word, now I start in a ChatGPT project canvas. During the course of work, I switch between AI team members. ChatGPT is the main thought partner – good at shaping ideas, helping structure prose, pushing back on woolly thinking. The 4o model is an excellent collaborator for many tasks, but if deeper thinking is needed I switch to ChatGPT-o3; the difference is remarkable.  When the flow falters, I’ll hop over to Perplexity: fast, well-cited, useful for breaking through with a different angle or clarifying a half-formed idea. NotebookLM, meanwhile, knows my  files; it acts like a personal librarian, drawing references and insight from the sources I’ve given it.

    It’s not seamless yet. But this is a distinctly new way of working, shaped by interaction between multiple AI agents. Even when ostensibly alone at my desk, the feeling is less of working as a solitary engineer and more as the leader of a team. A quiet, tireless, and always surprisingly helpful team. The skill here is no longer about choosing the best tool. It’s about understanding which kind of tool to use and when – treating each one as if it were a specialist on call.

    This hopping back and forth isn’t distraction, it’s coordination. Each assistant brings a slightly different lens. Triangulating between them often clears a logjam in my thinking or brings up a new insight. An intrguing thought is whether – or perhaps when – they will gain an awareness of each other’s contribution, turning coordination into collaboration.

    For the record, I drafted this post in a ChatGPT project canvas, helped by the AI in a chat alongside the canvas. When complete, I told the AI to write the “transparency label justification” below. It was spot on!

    Transparency Label Justification: This diary post was drafted collaboratively with ChatGPT using the GPT-4o model. Alec described his working patterns in detail and directed the structure and tone throughout. ChatGPT proposed phrasing, clarified distinctions between tools, and refined transitions, but the observations, reflections, and examples are all drawn from Alec’s own practice. The post captures a real and evolving style of AI-supported work, shaped and narrated by its practitioner.

  • Collaboration in ChatGPT?

    transparency label: human only

    There are reports that:

    OpenAI has been quietly developing collaboration features for ChatGPT that would let multiple users work together on documents and chat about projects, a direct assault on Microsoft’s core productivity business. The designs have been in development for nearly a year, with OpenAI’s Canvas feature serving as a first step toward full document collaboration tools.

    Source: The Information via BRXND Dispatch

    This would be a change towards something like simultaneous collaboration with colleagues on a shared document in Microsoft Office. At present, a ChatGPT Team account allows more than one person in the Team account to work in a project space and to take part in the chats within that project, but only one person at a time, as I understand it.

  • First thoughts on a lab framework

    transparency label: Human-only

    A few hours spent with ChatGPT-o3 resulting in good first draft of a framework for thinking about our labs. It covers

    • types of lab
    • the roles of people involved with the labs
    • the core technical configuration of a lab
    • assets needed to launch, operate, and archive a lab
    • a naming convention for these assets

    No doubt the framework will need to be tweaked and added to as our ideas mature.

    The chat with o3 was a valuable mind-clearing exercise for me, and I was impressed by how much more “intellectual” it is compared to the 4o model. Like many intellectuals, it also displayed a lack of common sense on occasions, especially when I asked for simple formatting corrections to the canvas we were editing together. The 4o model is much more agile in that respect.

    During the chat, when the flow with ChatGPT didn’t feel right, I hopped back and forth to consult with Perplexity and NotebookLM. Their outputs provided interestingly and usefully different perspectives that helped to clear the logjam.

    A decision arising from my joint AI consultation process was the choice of Google Workspace for the office productivity suite within our labs. This will allow for much better collaboration when using office tools with personal licences than would be the case with Microsoft Office 365. Given the ad hoc nature of labs and the cost constaints we have, this is an important consideration.

  • Use cases for NotebookLM

    Posting in his SubStack Adjacent Possible, Steven Johnson discusses how “language models are opening new avenues for inquiry in historical research and writing“. He suggests they can act as collaborative tools, rather than replacements for the writer’s engagement with primary sources.

    Johnson argues that NotebookLM is designed to facilitate rather than replace the reading of original sources. I t does so by making the entire source readable within the app, and by provinding inline citations linked directly to the original material.

    He identifies some interesting use cases.

    The AI can be a tool for collaborative brainstorming by allowing users to explore different hypotheses and see patterns within personally curated sources.

    NotebookLM can be used for targeted information retrieval.

    • It can help “fill in blank spots” or remind users of forgotten details from their readings.
    • The tool is valuable for fact-checking against uploaded source material.
    • For specific information, like in a car manual, it can provide direct answers to questions through a conversational Q&A format.

    It can enhance serendipitous discovery by suggesting surprising, less obvious connections amongst the sources.

    It can create mind maps from the sources, in effect indexing them on the fly.

    Finally he speculates on a future where e-books could come with a NotebookLM-like interface. This would bundle together the main work with all the original sources used by the author, enabling “timelines, mind maps, and explanations of key themes, anything you can think to ask”.

  • How ChatGPT helped draft our first acclimatisation lab setup

    Date: 24 June 2025

    transparency label: AI-heavy

    Our latest Lab Note records a quick experiment where I asked two ChatGPT models to draft the outline for an “acclimatisation” session – the starter lab we plan to run with newcomers to AI.

    Highlights:

    • Model face‑off: I ran the same prompt in parallel on model o3 and model 4o. The reasoning‑focused o3 delivered a tight nine‑part outline. 4o wandered off‑piste.
    • Time cost: The branch test took three minutes and gave us a clear winner.
    • Transparency: The Lab Note carries an AI‑heavy label because most of the prose came straight from o3. I trimmed, corrected one hallucination, and signed off.

    If you are curious about our process or want to see how structured prompting keeps the bot on track, read the full note here: First Acclimatisation Session Lab Note →

  • No substitute for reading the paper

    transparency label: Human-only

    … what I can say is that a theme throughout this self-analysis is this: I find ChatGPT to be a really useful tool when I already have some idea of what I want to do and when I’m actually engaged with the issue. I find it much less reliable or useful for completely automating parts of the process. There’s no substitute for reading the paper.

    Source: Sean Trott in his newsletter How I use (and don’t use) ChatGPT on 24 June 2025