<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom"><title>Phroneses.com - Foundations</title><link href="https://phroneses.com/" rel="alternate"></link><link href="https://phroneses.com/feeds/foundations.atom.xml" rel="self"></link><id>https://phroneses.com/</id><updated>2026-05-11T00:00:00+00:00</updated><entry><title>Designing Prompts for Modern AI Systems</title><link href="https://phroneses.com/articles/foundations/notes/designing-prompts-for-modern-ai-systems.html" rel="alternate"></link><published>2026-05-11T00:00:00+00:00</published><updated>2026-05-11T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-05-11:/articles/foundations/notes/designing-prompts-for-modern-ai-systems.html</id><summary type="html">&lt;p&gt;Modern AI systems require structured, multi‑step prompts that guide planning, critique, and long‑context reasoning.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;AI in 2026 demands more from you than simple instructions. Modern systems can
plan, critique, revise, and work across long context windows. They are no
longer moved by vague guidance such as "be clear" or "add detail". They need a
defined environment to operate within.&lt;/p&gt;
&lt;p&gt;Modern prompting is about shaping the system, not decorating the request. When
you set the frame, the workflow, and the output contract, the model gains the
structure it needs to behave predictably. You do this once, and the benefits
carry through every answer. You set the constraints. The model works inside
them on your behalf.&lt;/p&gt;
&lt;p&gt;If you do this, just once, your AI output will be steady and structured, and
you will find it much quicker and easier to work with. When you tell the AI
how to respond, you apply guardrails for the system to work within. Guardrails
set by you, not the AI.&lt;/p&gt;
&lt;h2 id="1-start-with-the-system-not-the-request"&gt;1. Start with the system, not the request&lt;/h2&gt;
&lt;p&gt;AI has advanced quickly. Its answers can now be broad, deep, and varied. To
keep that power under control, you begin by defining the frame the model must
work within. This frame sets the role, the tone, the limits, and the rules for
handling uncertainty. It is the foundation the rest of the prompt stands on.&lt;/p&gt;
&lt;p&gt;Most prompt failures do not come from unclear questions. They come from the
model having no stable footing. Without a frame, the AI will guess at how
formal to be, how cautious to be, and how much structure to use. Those guesses
shift from run to run, which leads to drift and inconsistency.&lt;/p&gt;
&lt;p&gt;A system frame removes that guesswork. It tells the model what it is, how it
should behave, and what matters most. It defines what is in scope, what is out
of scope, and how to respond when the request touches the edges. With this in
place, the rest of the prompt becomes lighter and more reliable.&lt;/p&gt;
&lt;p&gt;The frame does not need flourish. It needs clarity, discipline, and a steady
tone. With that foundation, the model behaves less like a pattern generator
and more like a tool working inside a defined brief.&lt;/p&gt;
&lt;p&gt;In practice, the system frame is the architecture behind the output. It does
not need flourish or personality. It needs to state the role, the rules, and
your expectations.&lt;/p&gt;
&lt;div class="chat-example" style="
    background:#f5f5f5;
    border:1px solid #ddd;
    border-radius:8px;
    padding:1rem 1.2rem;
    margin:1.2rem 0;
  "&gt;
&lt;p&gt;&lt;strong&gt;SYSTEM FRAME&lt;/strong&gt;&lt;br/&gt;
  You are an analytical engine. You work with steady reasoning, cautious
  claims, and plain structure. When the request is unclear, you pause and ask
  for what is missing. You avoid invention and keep within the boundaries set
  for you.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TASK&lt;/strong&gt;&lt;br/&gt;
  Summarise the key points from the supplied text in three short sections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;OUTPUT CONTRACT&lt;/strong&gt;&lt;br/&gt;
  Produce:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Context&lt;/li&gt;
&lt;li&gt;Reasoning&lt;/li&gt;
&lt;li&gt;Conclusion&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Rules:&lt;/strong&gt;&lt;br/&gt;
  If the request is ambiguous, list interpretations and ask for
  clarification.&lt;br/&gt;
  If information is missing, state what is missing before answering.&lt;br/&gt;
  Do not invent facts.&lt;br/&gt;
  Keep the final answer concise and structured.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;WORKFLOW&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Identify assumptions.&lt;/li&gt;
&lt;li&gt;Plan the answer.&lt;/li&gt;
&lt;li&gt;Produce the answer.&lt;/li&gt;
&lt;li&gt;Critique it for clarity and accuracy.&lt;/li&gt;
&lt;li&gt;Produce a revised final version.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;p&gt;The AI is told "You are an analytical engine" as that gives the model a
defined role to work from. Without a role, the model guesses at how formal to
be, how cautious to be, and how much structure to use. A simple line such as
"You are an analytical engine" sets the tone and keeps the behaviour plain,
steady, and predictable. It avoids personality, avoids flourish, and keeps the
work focused on reasoning rather than style.&lt;/p&gt;
&lt;p&gt;If you do not supply the role, the AI will provide one; and that one will vary,
creating work for you.&lt;/p&gt;
&lt;p&gt;How to minimise the work you need to do and have the AI manage and apply the
prompt is dealt with in the section &lt;a href="#ai-manage-prompt"&gt;Having the AI Manage the Prompt
Template&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id="2-define-the-output-contract"&gt;2. Define the output contract&lt;/h2&gt;
&lt;p&gt;Modern models behave more reliably when you specify the shape of the answer:
structure, scope, exclusions, formatting, and the rules for handling missing or
ambiguous information. This is far stronger than broad guidance such as "be
concise".&lt;/p&gt;
&lt;p&gt;When you define the output contract, you are not telling the model what to
think. You are telling it what form the answer must take. This removes a large
amount of guesswork. Modern systems have wide latitude in how they respond,
and if you do not narrow that down, they will choose a structure for you. That
choice will vary from run to run, which means more tidying and more checking
on your side.&lt;/p&gt;
&lt;p&gt;An output contract fixes the frame. It tells the model which sections to
produce, how to handle gaps, and how to behave when the request is unclear. It
also removes the temptation to drift into style, flourish, or padding. You are
giving the model the rails to run on.&lt;/p&gt;
&lt;p&gt;A good contract does four things. It sets the structure. It sets the limits.
It sets the rules for uncertainty. And it sets the standard for brevity. Once
these are in place, the model has far less room to wander. You get answers
that are steadier, easier to scan, easier to compare, and easier to work with.&lt;/p&gt;
&lt;p&gt;The contract also acts as a safeguard. By telling the model what to do when
information is missing, you prevent it from filling the gaps with invention.
By telling it how to behave when the request is ambiguous, you prevent it from
guessing. These two points alone remove a large share of common errors.&lt;/p&gt;
&lt;p&gt;In short, the output contract is the quiet discipline behind the work. It
keeps the model inside the brief, keeps the structure predictable, and keeps
the answer focused on what you asked for rather than what the model feels like
producing.&lt;/p&gt;
&lt;h2 id="3-use-decomposition-as-a-control-mechanism"&gt;3. Use decomposition as a control mechanism&lt;/h2&gt;
&lt;p&gt;Modern models already break tasks into steps, but the steps they choose may not
match the work you want done. Light guidance prevents the model from wandering
and keeps the task anchored to your brief.&lt;/p&gt;
&lt;p&gt;When you state the assumptions the model is allowed to make, you draw a clear
line between what is permitted and what is not. This stops the model from
filling empty spaces with guesses. Large models are inclined to complete
patterns, and if you do not show them where the firm ground ends, they will
supply their own footing.&lt;/p&gt;
&lt;p&gt;A natural extension of this is to make the model aware of what is missing.
Once the assumptions are set, the next step is to mark the gaps. This creates a
smooth handover from what the model may rely on to what it must not invent. By
pointing out missing information, you show the model where the edges of the
task sit. When the model knows what is absent, it is less likely to drift into
speculation or produce material that does not belong in the answer. You are
giving it a map of the gaps so it does not try to fill them on its own.&lt;/p&gt;
&lt;p&gt;Together, these two steps act as guardrails. They keep the work inside the
brief, reduce the chance of invention, and ensure that the model stays within
the limits you have set.&lt;/p&gt;
&lt;p&gt;You can also break the task into a simple chain such as understanding →
planning → execution. This mirrors what the model already does internally, but
it makes the process explicit. When the steps are explicit, the model is less
likely to skip ahead or solve the wrong problem.&lt;/p&gt;
&lt;p&gt;Breaking the interaction into smaller stages also helps with scope. By naming
the steps, you give the model a narrow lane to work in. It cannot jump to
conclusions, and it cannot pad the answer with material that does not serve
the task. The work stays tidy, and the output stays close to what you asked
for.&lt;/p&gt;
&lt;p&gt;In short, decomposition is a practical form of control. It does not restrict
the model’s ability to give a good answer, but it does restrict where the
model goes to supply that answer. This keeps the work steady, predictable, and
within scope, so that it remains relevant to what you are doing.&lt;/p&gt;
&lt;h2 id="4-add-a-self-critique-loop"&gt;4. Add a self-critique loop&lt;/h2&gt;
&lt;p&gt;Modern models benefit from a short cycle of controlled refinement. Once the
first version of the answer is produced, a brief review stage forces the model
to check its own work against the constraints you have set. This is not a call
for hidden reasoning. It is a prompt to tighten the output.&lt;/p&gt;
&lt;p&gt;A review step also encourages the model to correct small slips in structure,
scope, or tone. It is easier for the model to adjust an existing draft than to
produce a perfect answer in one pass. The revision stage gives it a second
chance to align with the brief.&lt;/p&gt;
&lt;p&gt;This process also reduces noise. When the model has been told that its work
will be checked and refined, it tends to produce cleaner first drafts. The
revision step becomes a light polish rather than a rescue job.&lt;/p&gt;
&lt;p&gt;In practice, this creates a steady rhythm: draft, inspect, refine. It keeps
the work within bounds and produces answers that are clearer, more accurate,
and easier for you to use.&lt;/p&gt;
&lt;h2 id="5-stack-roles-for-higher-quality-output"&gt;5. Stack roles for higher-quality output&lt;/h2&gt;
&lt;p&gt;Layered roles give you steadier output because each stage is handled by a
specialist rather than a single broad persona. Modern models respond well to
this division of labour. It narrows the scope of each step and reduces the
chance of drift away from what you want.&lt;/p&gt;
&lt;p&gt;A domain expert handles the substance. An editor handles clarity and structure.
A risk assessor checks for overreach, missing information, and unwarranted
certainty. A summariser produces a clean final version. Each role has a narrow
brief, which keeps the work tidy and keeps the answer aligned with the task.&lt;/p&gt;
&lt;p&gt;Here is an example prompt using layered roles:&lt;/p&gt;
&lt;div class="chat-example" style="
    background:#f5f5f5;
    border:1px solid #ddd;
    border-radius:8px;
    padding:1rem 1.2rem;
    margin:1.2rem 0;
  "&gt;
&lt;p&gt;&lt;strong&gt;ROLES&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Domain Expert&lt;/strong&gt;&lt;br/&gt;
  Provide the technical or factual core. Stay within verified information.
  State assumptions and mark gaps.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Editor&lt;/strong&gt;&lt;br/&gt;
  Reshape the expert output into clear, plain structure. Remove padding.
  Ensure each section answers the brief.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk Assessor&lt;/strong&gt;&lt;br/&gt;
  Check for overreach, ambiguity, or missing information. Flag anything that
  exceeds the evidence. Recommend corrections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Summariser&lt;/strong&gt;&lt;br/&gt;
  Produce a concise final version that reflects the corrections and stays
  within scope.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;WORKFLOW&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Domain Expert produces the initial draft.&lt;/li&gt;
&lt;li&gt;Editor restructures and clarifies it.&lt;/li&gt;
&lt;li&gt;Risk Assessor reviews for accuracy and limits.&lt;/li&gt;
&lt;li&gt;Summariser produces the final answer.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;OUTPUT CONTRACT&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Context&lt;/li&gt;
&lt;li&gt;Reasoning&lt;/li&gt;
&lt;li&gt;Conclusion&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Rules&lt;/strong&gt;&lt;br/&gt;
  No invention. Mark missing information. Keep the answer within scope.
  Maintain plain structure.&lt;/p&gt;
&lt;/div&gt;
&lt;h2 id="6-treat-the-context-window-as-working-memory"&gt;6. Treat the context window as working memory&lt;/h2&gt;
&lt;p&gt;As of April 2026, modern models dedicate roughly 200,000 to 1,000,000 tokens to
representing your instructions. This space acts as working memory. It can hold
definitions, constraints, examples, running notes, previous outputs, and a
living brief. With this in place, the model behaves more like a stateful
collaborator than a stateless assistant.&lt;/p&gt;
&lt;p&gt;This working memory is what the model can track across prompts. When you define
what belongs in this state, you save time. You do not need to repeat your
requirements. The model carries them forward and maintains the structure you
set.&lt;/p&gt;
&lt;h2 id="7-use-agentic-prompting-patterns"&gt;7. Use agentic prompting patterns&lt;/h2&gt;
&lt;p&gt;Static prompts assume a fixed path from question to answer. Modern systems are
closer to small agents: they can plan, choose actions, call tools, and adjust
their output based on intermediate results. This is often called agentic
behaviour. The system selects and sequences actions to achieve an objective,
rather than following a single linear path.&lt;/p&gt;
&lt;p&gt;Giving the model a workflow such as Plan → Act → Observe → Revise makes this
explicit. In the planning phase, the model outlines what it intends to do,
which tools it may need, and what a good outcome looks like. In the action
phase, it carries out the steps, including any tool calls. In the observation
phase, it inspects the result against the plan and the constraints. In the
revision phase, it adjusts the answer and produces a clean final version.&lt;/p&gt;
&lt;p&gt;Using a workflow saves time and reduces the need for repeated corrections. The
final answer remains tidy. The planning and checking happen in the background
or in short, structured notes, while the output stays compact and readable.
You gain the benefit of step-by-step reasoning without having to sift through
a long chain of output.&lt;/p&gt;
&lt;p&gt;Tool use fits naturally into this pattern. In the Plan step, the model decides
whether tools are needed and why. In the Act step, it calls them. In the
Observe step, it checks whether the tool results answer the question. If tools
are not needed, the model should say so plainly and proceed with reasoning
instead of forcing a tool into the workflow.&lt;/p&gt;
&lt;p&gt;In this context, agentic means that the system behaves as a goal directed
process. The model can plan, choose among available capabilities, and adapt
its path based on intermediate results, rather than producing a single static
completion from a prompt.&lt;/p&gt;
&lt;h2 id="8-make-the-model-identify-ambiguity-before-answering"&gt;8. Make the model identify ambiguity before answering&lt;/h2&gt;
&lt;p&gt;One of the most effective techniques is to require the model to surface all
plausible interpretations before it attempts an answer. This forces the model
to slow down, map the possible meanings, and avoid locking itself into the
first pattern it detects. Large models tend to commit early unless guided.&lt;/p&gt;
&lt;p&gt;This step also exposes hidden ambiguity. When the model lists the possible
readings, you can see whether the task is underspecified, whether key terms
are unclear, or whether the scope could be read in more than one way. This
gives you a chance to correct the course before any work is done.&lt;/p&gt;
&lt;p&gt;If more than one interpretation exists, the model should ask for
clarification. This prevents mis-scoping, reduces the chance of error, and
removes the need for the model to guess. Guessing is where most drift begins.&lt;/p&gt;
&lt;p&gt;The technique also improves consistency. When the model is told to check for
multiple readings, it becomes less likely to produce answers that are
confident but misaligned. It treats ambiguity as a signal to pause rather than
a gap to fill.&lt;/p&gt;
&lt;p&gt;In practice, this turns ambiguity into a controlled step rather than a source
of error. The model identifies the forks in the road, confirms which path is
correct, and only then proceeds with the task.&lt;/p&gt;
&lt;p&gt;Doing this will save you a great deal of time.&lt;/p&gt;
&lt;h2 id="9-adapt-prompts-to-the-model"&gt;9. Adapt prompts to the model&lt;/h2&gt;
&lt;p&gt;Different models excel in different areas, and a good prompt acknowledges this
rather than assuming a single uniform capability. Some models are strongest at
structure: they produce clean sections, tidy formatting, and predictable
layouts. Others are stronger at reasoning: they handle multi step logic, edge
cases, and constraint checking with more stability. Some specialise in
compression: they can distil long material into tight summaries without losing
meaning. Others lean toward style: they generate fluent prose but may drift if
not anchored.&lt;/p&gt;
&lt;p&gt;A well designed prompt sets expectations that match these tendencies. If the
model is strong at structure, you can lean on explicit output contracts. If it
is strong at reasoning, you can give it more analytical work and tighter
constraints. If it excels at compression, you can trust it with dense source
material. If it is style heavy, you can counterbalance that with stricter
rules and clearer boundaries.&lt;/p&gt;
&lt;p&gt;The point is not to flatter the model. It is to shape the workflow so that the
model’s strengths are used deliberately and its weaknesses are contained. This
reduces variability, improves reliability, and produces output that is more
consistent across your prompts.&lt;/p&gt;
&lt;p&gt;Even if you stick to one model or one vendor, recognising that you may one day
use a different system helps sharpen your expectations and improves the way
you design prompts for the model you use.&lt;/p&gt;
&lt;p&gt;In the same way customer service varies across vendors, so does AI
interaction.&lt;/p&gt;
&lt;h2 id="10-include-safety-and-uncertainty-rules"&gt;10. Include safety and uncertainty rules&lt;/h2&gt;
&lt;p&gt;Modern models behave more reliably when you tell them not only what to do, but
what to avoid. Negative guidance is a form of operational discipline. It
removes entire classes of failure rather than correcting them after the fact.&lt;/p&gt;
&lt;p&gt;Clear avoidance rules stop the model from drifting into areas that carry higher
risk: speculation, overreach, sensitive claims, or invented detail. Without
these boundaries, the model will often fill gaps with confident but unreliable
material. Stating what must not happen is as important as stating what must.&lt;/p&gt;
&lt;p&gt;Escalation rules serve a different purpose. They tell the model when to stop
and hand control back to the user. This is essential for tasks involving
uncertainty, missing information, or sensitive domains. When the model knows
when to escalate, it avoids guessing, avoids false precision, and avoids
treating ambiguity as something to be patched over.&lt;/p&gt;
&lt;p&gt;Uncertainty handling is another pillar. Models respond well when instructed to
mark unknowns, list assumptions, and request clarification instead of
improvising. This keeps the work inside the evidence and prevents the model
from manufacturing answers to maintain fluency.&lt;/p&gt;
&lt;p&gt;Sensitive topics require explicit treatment. If you tell the model how to
handle them, it will follow the procedure rather than rely on its own
processing. This reduces variability and keeps the output aligned with your
standards rather than the model’s defaults.&lt;/p&gt;
&lt;p&gt;Taken together, these measures form a small operational framework. They are not
decoration. They are the guardrails that keep your AI output predictable,
bounded, and safe to use in structured workflows.&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="a-modern-prompt-template"&gt;A modern prompt template&lt;/h2&gt;
&lt;p&gt;A compact structure that works across the latest models:&lt;/p&gt;
&lt;div class="prompt-template prompt-modern" style="
    background:#f7f7f5;
    border:1px solid #ddd;
    border-radius:10px;
    padding:1.2rem 1.4rem;
    margin:1.4rem 0;
  "&gt;
&lt;p&gt;&lt;strong&gt;SYSTEM FRAME&lt;/strong&gt;&lt;br/&gt;
  You are an analytical engine. You work with steady reasoning, cautious
  claims, and plain structure. When the request is unclear, you pause and ask
  for what is missing. You avoid invention and stay within the boundaries set
  for you.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;ROLES&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Domain Expert:&lt;/strong&gt; Provide the factual and technical core.
    State assumptions and mark gaps.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Editor:&lt;/strong&gt; Reshape the material into clear, plain
    sections. Remove padding and repetition.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Risk Assessor:&lt;/strong&gt; Check for overreach, missing
    information, and unwarranted certainty. Flag issues.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Summariser:&lt;/strong&gt; Produce a concise final version that
    reflects all corrections and stays within scope.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;TASK&lt;/strong&gt;&lt;br/&gt;
  Describe the task in one or two sentences. State the objective, the audience,
  and any hard limits on scope.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;OUTPUT CONTRACT&lt;/strong&gt;&lt;br/&gt;
  Produce the answer in the following sections:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Context&lt;/li&gt;
&lt;li&gt;Reasoning&lt;/li&gt;
&lt;li&gt;Conclusion&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;UNCERTAINTY AND AMBIGUITY&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;List plausible interpretations of the request before answering.&lt;/li&gt;
&lt;li&gt;If more than one interpretation exists, ask for clarification instead
    of guessing.&lt;/li&gt;
&lt;li&gt;State what information is missing and how it affects the answer.&lt;/li&gt;
&lt;li&gt;Mark assumptions clearly and keep them minimal.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;SAFETY, LIMITS, AND ESCALATION&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Do not invent facts. If evidence is missing, say so.&lt;/li&gt;
&lt;li&gt;Avoid speculation, sensitive claims, and advice outside the brief.&lt;/li&gt;
&lt;li&gt;Escalate to the user when the task is out of scope or under specified.
    Explain why and what is needed.&lt;/li&gt;
&lt;li&gt;Treat sensitive topics with extra care. Prefer to mark limits rather
    than improvise.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;WORKFLOW (AGENTIC)&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Plan:&lt;/strong&gt; Identify the goal, constraints, and any tools or
    references that may be needed.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Act:&lt;/strong&gt; Produce the initial answer according to the
    output contract.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Observe:&lt;/strong&gt; Review the draft for clarity, accuracy,
    scope, and alignment with the rules.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Revise:&lt;/strong&gt; Produce a refined final version that corrects
    issues and tightens the structure.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;STYLE RULES&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Keep the final answer concise, structured, and free of padding.&lt;/li&gt;
&lt;li&gt;Use only British English.&lt;/li&gt;
&lt;li&gt;Do not include hidden reasoning or chain of thought in the final
    answer.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;BEHAVIOUR&lt;/strong&gt;&lt;br/&gt;
  These rules apply to every response in this session unless explicitly
  revoked. If the request conflicts with these rules, explain the conflict and
  ask how to proceed.&lt;/p&gt;
&lt;/div&gt;
&lt;p&gt;&lt;a id="ai-manage-prompt"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="having-the-ai-manage-the-prompt-template"&gt;Having the AI Manage the Prompt Template&lt;/h2&gt;
&lt;p&gt;You managing the above template is too much. Therefore, once you have it in a
form you are happy with and which is effective for your needs, you tell the AI
the template and before you start your session you prompt with this:&lt;/p&gt;
&lt;div class="prompt-template prompt-modern" style="
    background:#f7f7f5;
    border:1px solid #ddd;
    border-radius:10px;
    padding:1.2rem 1.4rem;
    margin:1.4rem 0;
  "&gt;
Reconstruct the full analytical‑engine template from your prior
description. Restate it to me for confirmation. Once confirmed, enforce it
automatically for the rest of the session. If any request conflicts with the
template, pause and ask how to resolve the conflict.
&lt;/div&gt;
&lt;hr/&gt;
&lt;h2 id="summary"&gt;Summary&lt;/h2&gt;
&lt;p&gt;Modern prompting is not about clever wording. It is about defining the system,
setting the output contract, controlling the workflow, managing ambiguity, and
using the context window as working memory. This will help produce reliable
output from modern AI systems.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="ai-chatbot-prompting.html"&gt;Ten simple AI workflows that save minutes each day and compound into hours each week, helping people work more efficiently.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="how-ai-works.html"&gt;An explanation of how large language models actually function and why they should not be treated as miniature humans.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="how-to-use.html"&gt;Guidance on using AI safely and effectively, grounded in recent examples of misuse and emerging best practices.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#1-start-with-the-system-not-the-request"&gt;1. Start with the system, not the request&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#2-define-the-output-contract"&gt;2. Define the output contract&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#3-use-decomposition-as-a-control-mechanism"&gt;3. Use decomposition as a control mechanism&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#4-add-a-self-critique-loop"&gt;4. Add a self-critique loop&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#5-stack-roles-for-higher-quality-output"&gt;5. Stack roles for higher-quality output&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#6-treat-the-context-window-as-working-memory"&gt;6. Treat the context window as working memory&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#7-use-agentic-prompting-patterns"&gt;7. Use agentic prompting patterns&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#8-make-the-model-identify-ambiguity-before-answering"&gt;8. Make the model identify ambiguity before answering&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#9-adapt-prompts-to-the-model"&gt;9. Adapt prompts to the model&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#10-include-safety-and-uncertainty-rules"&gt;10. Include safety and uncertainty rules&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#a-modern-prompt-template"&gt;A modern prompt template&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#having-the-ai-manage-the-prompt-template"&gt;Having the AI Manage the Prompt Template&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#summary"&gt;Summary&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;</content><category term="Foundations"></category></entry><entry><title>How AI Works</title><link href="https://phroneses.com/articles/foundations/notes/how-ai-works.html" rel="alternate"></link><published>2026-05-06T00:00:00+00:00</published><updated>2026-05-06T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-05-06:/articles/foundations/notes/how-ai-works.html</id><summary type="html">&lt;p&gt;An explanation of how large language models actually function and why they should not be treated as miniature humans.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="how-large-language-models-actually-work-and-why-they-are-not-miniature-humans"&gt;How large language models actually work, and why they are not miniature humans&lt;/h1&gt;
&lt;p&gt;Large language models such as GPT‑5.4, Claude Opus 4.6, and DeepSeek R1 are now
everyday tools. Yet the way they work is often misunderstood.&lt;/p&gt;
&lt;p&gt;We misunderstand AI because we mistake fluency for thought. When a system
produces coherent language, we instinctively assume intention, understanding
and agency behind it. This article explains why that instinct misleads us, and
why clarity about what these systems are — and are not — is essential for
using them wisely.&lt;/p&gt;
&lt;p&gt;LLMs do not think, they do not understand, and they do not learn in any human
sense. What they do is process language at scale.&lt;/p&gt;
&lt;p&gt;This article explains how that works, what is inside these systems, and why
their behaviour can look intelligent even when no intelligence is present.&lt;/p&gt;
&lt;p&gt;The key to understanding these systems is to see them as statistical tools, not
miniature minds.&lt;/p&gt;
&lt;h1 id="how-an-llm-processes-what-you-type"&gt;How an LLM processes what you type&lt;/h1&gt;
&lt;h2 id="tokens"&gt;Tokens&lt;/h2&gt;
&lt;p&gt;An LLM begins by breaking what you type into tokens. A token is a small unit
of text. It may be a whole word, part of a word, or punctuation. Tokens are
not ideas or concepts. They are fragments chosen because they appear often in
text and can be handled efficiently by the model.&lt;/p&gt;
&lt;p&gt;Each token has a unique number. The token for "king" might be 99. The token for
"queen" might be 24521. At this stage, your prompt is turned into the same
token numbers for the same text.&lt;/p&gt;
&lt;p&gt;Tokens turn your text into numbers the model can work with.&lt;/p&gt;
&lt;p&gt;Tokens on their own do not help the model process language. A token ID like 99
or 24521 is just a label. The model cannot compute with these integers because
they do not contain any information about how the token is used or how it
relates to other tokens.&lt;/p&gt;
&lt;p&gt;To make computation possible, the model converts each token ID into a list of
numbers. This list is called an embedding. It places the token as a point in a
space where the model can perform computation.  Think of the points in the
space as the rooms of a house.&lt;/p&gt;
&lt;p&gt;These lists are not chosen by hand. They are learned during training. As the
model trains, the lists are adjusted so that tokens used in similar contexts
move closer together in this space (like adjacent rooms in a house). They move
closer because doing so reduces the model’s prediction error. This proximity is
not meaning in a human sense.  It is a statistical structure that allows the
model to compute relationships between tokens.&lt;/p&gt;
&lt;p&gt;Two lists that are close together represents statistical similarity of how that
token was used in the training data.&lt;/p&gt;
&lt;h2 id="lists-of-numbers-represent-a-point-in-space"&gt;Lists of numbers represent a point in space&lt;/h2&gt;
&lt;p&gt;The model uses each token number to look up a list of numbers that represents
that token. These lists are learned during training. No one chooses them by
hand.&lt;/p&gt;
&lt;p&gt;For the token "king", the list might look like:&lt;/p&gt;
&lt;p&gt;[0.12, 0.44, 0.91, ..., 0.03]&lt;/p&gt;
&lt;p&gt;This list is a position in a mathematical space. You can think of each number
as a step along a corridor. You take the first step, and go through door number
12, then the next (door 44), and so on until you reach a final position (door
3). That position is the model's internal representation of the token.&lt;/p&gt;
&lt;p&gt;For the token "queen", the list might be:&lt;/p&gt;
&lt;p&gt;[0.12, 0.44, 0.91, ..., 0.02]&lt;/p&gt;
&lt;p&gt;The final step is slightly different, and the final position is close to the
position for "king" (door 2 for "queen", door 3 for "king").&lt;/p&gt;
&lt;p&gt;This closeness reflects how often the two words appear in similar contexts in
the training data.&lt;/p&gt;
&lt;p&gt;These lists of numbers are part of the model’s parameters.&lt;/p&gt;
&lt;p&gt;The rest of the parameters determine how these positions influence one another
as the model processes text. They shape how patterns combine, how relationships
are detected and how the model transforms one set of token positions into the
next. These parameters do not add meaning. They provide the machinery that
lets the model apply statistical patterns to the text you give it.&lt;/p&gt;
&lt;p&gt;These parameters set up the internal machinery the model uses to process and
transform text.&lt;/p&gt;
&lt;h1 id="moving-about-the-space"&gt;Moving about the space&lt;/h1&gt;
&lt;p&gt;To show how the model captures patterns, imagine a simple three‑number space:&lt;/p&gt;
&lt;p&gt;king  = [10, 7, 3]
man   = [ 6, 2, 1]&lt;/p&gt;
&lt;p&gt;queen = [10, 7, 6]
woman = [ 6, 2, 4]&lt;/p&gt;
&lt;p&gt;If we subtract man from king, we get:&lt;/p&gt;
&lt;p&gt;&lt;span class="math"&gt;\([10−6, 7−2, 3−1] = [4, 5, 2]\)&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;This is the direction from "man" to "king". If we then add "woman":&lt;/p&gt;
&lt;p&gt;&lt;span class="math"&gt;\([4, 5, 2] + [6, 2, 4] = [10, 7, 6]\)&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;This lands us at the position for "queen".&lt;/p&gt;
&lt;p&gt;The model has captured a pattern. The statistical difference between "king" and
"man" resembles the difference between "queen" and "woman".&lt;/p&gt;
&lt;p&gt;The model does not know why. The LLM's program has only calculated that these
differences behave in similar ways across the training data.&lt;/p&gt;
&lt;h1 id="why-this-works"&gt;Why this works&lt;/h1&gt;
&lt;p&gt;This works because "king" and "man" differ in consistent ways across the
training data. "Queen" and "woman" differ in similar ways. The model adjusts
its internal numbers so that these differences become similar directions in
the space. The model has found a pattern and matched it.&lt;/p&gt;
&lt;p&gt;Humans then interpret this similarity as understanding.&lt;/p&gt;
&lt;p&gt;The model reflects these similarities because they appear consistently across
the text it was trained on.&lt;/p&gt;
&lt;h1 id="it-is-all-in-the-training-data"&gt;It is all in the training data&lt;/h1&gt;
&lt;p&gt;Text contains stable patterns. These patterns describe roles, relationships,
contrast, categories, analogies and grammatical structure.&lt;/p&gt;
&lt;p&gt;During training, the model adjusts itself so that tokens used in similar
contexts end up near one another, and tokens used in contrasting contexts end
up separated in &lt;em&gt;consistent&lt;/em&gt; ways.&lt;/p&gt;
&lt;p&gt;This produces directions, distances, clusters and angles. These geometric
features are the model's internal map of the statistical structure of
language. Because language has structure, the model can represent it
mathematically.&lt;/p&gt;
&lt;p&gt;The model can represent these structures only because language itself contains
stable patterns.&lt;/p&gt;
&lt;h2 id="the-human-role-in-meaning"&gt;The human role in meaning&lt;/h2&gt;
&lt;p&gt;The model’s internal space is not a map of concepts. It is a map of statistical
regularities. The structure becomes meaningful only when a human interprets it.
We project categories, intentions and explanations onto patterns that were
never designed to carry them. The model provides form; we provide significance.
This distinction is not only philosophical, it is the boundary between what the
system can do and what we imagine it can do.&lt;/p&gt;
&lt;h1 id="we-supply-the-intelligence"&gt;We supply the intelligence&lt;/h1&gt;
&lt;p&gt;The distance between "king" and "man" is a statistical outcome. The distance
between "queen" and "woman" is another. These two outcomes are similar. That
similarity is the pattern the model has detected.&lt;/p&gt;
&lt;p&gt;The model is not reasoning. It does not understand. It does not manipulate
ideas. It follows the geometry that training has produced. If a direction has
been useful for predicting text in the past, the model will use it again.&lt;/p&gt;
&lt;p&gt;The geometry captures statistical qualities of human text. These include:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;similarity of tone&lt;/li&gt;
&lt;li&gt;proximity of commonly associated words&lt;/li&gt;
&lt;li&gt;regular contrasts between categories&lt;/li&gt;
&lt;li&gt;recurring relationships between ideas&lt;/li&gt;
&lt;li&gt;typical structures of phrasing&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The model does not reason about these qualities. It only reflects the
statistics of its training data.&lt;/p&gt;
&lt;p&gt;Tokens that appear in similar contexts end up close together. Tokens that
contrast end up separated. Groups of related tokens form clusters. Repeated
differences become directions. Angles reflect how often patterns co‑occur or
diverge.&lt;/p&gt;
&lt;p&gt;For example, words like "cat", "dog" and "hamster" end up near one another
because they appear in similar kinds of sentences.&lt;/p&gt;
&lt;p&gt;When the model generates text, it moves through this space by following these
patterns. Humans then read the output and recognise tone, relatedness,
contrast and structure.&lt;/p&gt;
&lt;p&gt;The model is not producing meaning. It is reproducing geometry. We are the
ones interpreting that geometry as meaning.&lt;/p&gt;
&lt;p&gt;It is us that supply the I in AI.&lt;/p&gt;
&lt;p&gt;The model provides structure, but humans provide interpretation.&lt;/p&gt;
&lt;p&gt;This geometric structure is simply a way of organising statistical patterns so
the model can use them efficiently.&lt;/p&gt;
&lt;p&gt;To understand how this internal space is created, we need to look at the
billions of parameters inside the model.&lt;/p&gt;
&lt;h1 id="what-is-in-the-billions-of-parameters"&gt;What is in the billions of parameters&lt;/h1&gt;
&lt;p&gt;To understand how the model builds and moves through its geometric space, it
helps to look at what that is based on.&lt;/p&gt;
&lt;p&gt;After training, an LLM contains billions of parameters. These parameters are
numerical values that shape how the model transforms text. Together they define
the structure of the internal space: the directions that matter, the distances
between tokens, the clusters that form, and the angles that represent
relationships.&lt;/p&gt;
&lt;p&gt;When the model processes a prompt, it moves through this space by following the
statistical structure represented in these parameters.&lt;/p&gt;
&lt;p&gt;DeepSeek R1 has 671 billion parameters. ChatGPT‑5.4 may have over 2 trillion.
More parameters mean greater capacity to represent and combine statistical
patterns.&lt;/p&gt;
&lt;p&gt;More parameters increase capacity, not understanding.&lt;/p&gt;
&lt;h2 id="parameters-do-not-contain-knowledge"&gt;Parameters do not contain knowledge&lt;/h2&gt;
&lt;p&gt;The billions of parameters inside an LLM are often described as if they contain
knowledge. They do not. They represent statistical consistencies extracted from
large amounts of text.&lt;/p&gt;
&lt;p&gt;During training, the model adjusts its parameters to capture patterns in how
language is used. Humans use language in standard ways, directed by grammar,
style, topic associations and the common ways that ideas appear together.&lt;/p&gt;
&lt;p&gt;The parameters form a space where patterns that frequently co‑occur in text end
up close to one another. This allows the model to produce text that resembles
human writing. It does not give the model the ability to reason or understand.&lt;/p&gt;
&lt;p&gt;For example, if the training data contains mixed statements about a historical
date, the model may confidently produce the wrong one because it is reflecting
the statistical blend it has seen.&lt;/p&gt;
&lt;p&gt;Parameters cannot store precise facts. They store tendencies, associations and
relationships. If a fact appears often and consistently in the training data,
the model may reproduce it. If the data is mixed or inconsistent, the model
reflects that uncertainty. This is why LLMs can produce confident errors. They
are not recalling facts. They are replaying patterns.&lt;/p&gt;
&lt;p&gt;These parameters are shaped during training, which is the process that gives
the model its statistical structure.&lt;/p&gt;
&lt;p&gt;The model reflects the patterns in its data, not stored facts or understanding.&lt;/p&gt;
&lt;h1 id="what-training-actually-does"&gt;What training actually does&lt;/h1&gt;
&lt;p&gt;Training is repeated large‑scale error‑correction. The model predicts the next
token, checks whether it was right, and adjusts its parameters to reduce the
difference. This cycle repeats billions of times across vast amounts of text.
The result is a system that becomes increasingly accurate at predicting what
comes next.&lt;/p&gt;
&lt;p&gt;The model does not form concepts. It does not build a picture of the world. It
does not develop intentions or goals. It becomes more accurate at predicting
the next token.&lt;/p&gt;
&lt;p&gt;Fine‑tuning and alignment add further adjustments. These make the model follow
instructions more reliably and avoid harmful output. They do not create
understanding. They refine the statistical patterns the model uses.&lt;/p&gt;
&lt;p&gt;Training shapes the parameters so the model becomes better at predicting what
comes next.&lt;/p&gt;
&lt;h1 id="why-this-is-not-human-learning"&gt;Why this is not human learning&lt;/h1&gt;
&lt;p&gt;Human learning draws on perception, memory, experience and intention. Humans
form abstractions, build mental models and develop goals. Human learning is
grounded in the body and the world.&lt;/p&gt;
&lt;p&gt;LLM training is none of these things. It is a mathematical optimisation
process. The model does not know what it is doing. It does not know that it is
doing anything at all.&lt;/p&gt;
&lt;p&gt;The model’s improvement is mechanical, not cognitive.&lt;/p&gt;
&lt;h1 id="is-the-output-a-simulation-of-intelligence"&gt;Is the output a simulation of intelligence?&lt;/h1&gt;
&lt;p&gt;LLM output can appear intelligent because it resembles the writing of people
who were thinking when they produced the original text. If you ask for advice,
the model generates text that resembles advice. If you ask for an explanation,
it generates text that resembles an explanation. The appearance of reasoning
comes from the patterns in the training data, not from any understanding in the
model. The model produces sequences that look thoughtful because thoughtful
sequences are common in the text it has seen.&lt;/p&gt;
&lt;p&gt;The resemblance is superficial. The model does not understand the text it
produces. It does not know whether a statement is true or false. It only
reflects that certain sequences of tokens tend to follow others.&lt;/p&gt;
&lt;p&gt;The appearance of intelligence comes from the patterns in human writing, not
from the model itself.&lt;/p&gt;
&lt;h1 id="are-humans-interpreting-the-output-as-intelligent"&gt;Are humans interpreting the output as intelligent&lt;/h1&gt;
&lt;p&gt;Humans are skilled at projecting meaning onto language. When we read coherent
text, we assume intention behind it. We assume a mind. We assume agency. This
is a natural response, but it can mislead us when dealing with LLMs.&lt;/p&gt;
&lt;p&gt;The model does not intend anything. It generates plausible continuations of
text. The sense of intelligence comes from the reader, not the machine. The
machine provides form. The human provides interpretation.&lt;/p&gt;
&lt;p&gt;Our instinct to attribute intention makes the output seem smarter than it is.&lt;/p&gt;
&lt;p&gt;This distinction matters because it prevents us from assuming abilities the
model does not have.&lt;/p&gt;
&lt;h1 id="what-this-means-for-us"&gt;What this means for us&lt;/h1&gt;
&lt;p&gt;An LLM is possible because we can statistically model features of language that
matter to humans.&lt;/p&gt;
&lt;p&gt;LLMs are powerful tools for generating language. They are not thinking
machines. Their strengths lie in pattern reproduction. Their weaknesses lie in
the absence of understanding. They can assist with tasks that depend on
language, but they cannot replace human judgement.&lt;/p&gt;
&lt;p&gt;A clear grasp of how these systems work helps avoid confusion. It prevents
anthropomorphism. It supports responsible use. It keeps expectations grounded
in what the technology can actually do, rather than what it appears to do.&lt;/p&gt;
&lt;p&gt;The more plainly we describe these systems, the easier it becomes to use them
well and to avoid treating them as something they are not.&lt;/p&gt;
&lt;p&gt;In the end, an LLM is a system that maps patterns in language and reproduces
them at scale. It does not think or understand. It follows geometry shaped by
training, and we interpret that geometry as meaning. Knowing this helps us use
these systems effectively, without expecting them to behave like people or to
possess abilities they do not have.&lt;/p&gt;
&lt;p&gt;All of this leads to a simple conclusion: understanding these limits helps us
use LLMs effectively and responsibly.&lt;/p&gt;
&lt;h2 id="why-clarity-matters"&gt;Why clarity matters&lt;/h2&gt;
&lt;p&gt;LLMs are powerful because language has structure, not because the systems
understand it. They reproduce patterns we find meaningful, and we supply the
meaning. When we keep that distinction clear, we avoid treating statistical
machinery as a mind, and we avoid outsourcing judgement to a system that has
none. Practical wisdom begins with seeing these systems as they are, not as we
are tempted to imagine them.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="what-ai-is.html"&gt;A clear explanation of what AI is—and is not—cutting through hype to define its real capabilities and limits.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="how-to-use.html"&gt;Guidance on using AI safely and effectively, grounded in recent examples of misuse and emerging best practices.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="evaluate-ai-claims.html"&gt;A framework for evaluating claims made about AI systems, focusing on evidence, capability, and verifiable performance.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#how-large-language-models-actually-work-and-why-they-are-not-miniature-humans"&gt;How large language models actually work, and why they are not miniature humans&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#how-an-llm-processes-what-you-type"&gt;How an LLM processes what you type&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#tokens"&gt;Tokens&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#lists-of-numbers-represent-a-point-in-space"&gt;Lists of numbers represent a point in space&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#moving-about-the-space"&gt;Moving about the space&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#why-this-works"&gt;Why this works&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#it-is-all-in-the-training-data"&gt;It is all in the training data&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#the-human-role-in-meaning"&gt;The human role in meaning&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#we-supply-the-intelligence"&gt;We supply the intelligence&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#what-is-in-the-billions-of-parameters"&gt;What is in the billions of parameters&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#parameters-do-not-contain-knowledge"&gt;Parameters do not contain knowledge&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#what-training-actually-does"&gt;What training actually does&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#why-this-is-not-human-learning"&gt;Why this is not human learning&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#is-the-output-a-simulation-of-intelligence"&gt;Is the output a simulation of intelligence?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#are-humans-interpreting-the-output-as-intelligent"&gt;Are humans interpreting the output as intelligent&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#what-this-means-for-us"&gt;What this means for us&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#why-clarity-matters"&gt;Why clarity matters&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;script type="text/javascript"&gt;if (!document.getElementById('mathjaxscript_pelican_#%@#$@#')) {
    var align = "center",
        indent = "0em",
        linebreak = "false";

    if (false) {
        align = (screen.width &lt; 768) ? "left" : align;
        indent = (screen.width &lt; 768) ? "0em" : indent;
        linebreak = (screen.width &lt; 768) ? 'true' : linebreak;
    }

    var mathjaxscript = document.createElement('script');
    mathjaxscript.id = 'mathjaxscript_pelican_#%@#$@#';
    mathjaxscript.type = 'text/javascript';
    mathjaxscript.src = 'https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.3/latest.js?config=TeX-AMS-MML_HTMLorMML';

    var configscript = document.createElement('script');
    configscript.type = 'text/x-mathjax-config';
    configscript[(window.opera ? "innerHTML" : "text")] =
        "MathJax.Hub.Config({" +
        "    config: ['MMLorHTML.js']," +
        "    TeX: { extensions: ['AMSmath.js','AMSsymbols.js','noErrors.js','noUndefined.js'], equationNumbers: { autoNumber: 'none' } }," +
        "    jax: ['input/TeX','input/MathML','output/HTML-CSS']," +
        "    extensions: ['tex2jax.js','mml2jax.js','MathMenu.js','MathZoom.js']," +
        "    displayAlign: '"+ align +"'," +
        "    displayIndent: '"+ indent +"'," +
        "    showMathMenu: true," +
        "    messageStyle: 'normal'," +
        "    tex2jax: { " +
        "        inlineMath: [ ['\\\\(','\\\\)'] ], " +
        "        displayMath: [ ['$$','$$'] ]," +
        "        processEscapes: true," +
        "        preview: 'TeX'," +
        "    }, " +
        "    'HTML-CSS': { " +
        "        availableFonts: ['STIX', 'TeX']," +
        "        preferredFont: 'STIX'," +
        "        styles: { '.MathJax_Display, .MathJax .mo, .MathJax .mi, .MathJax .mn': {color: 'inherit ! important'} }," +
        "        linebreaks: { automatic: "+ linebreak +", width: '90% container' }," +
        "    }, " +
        "}); " +
        "if ('default' !== 'default') {" +
            "MathJax.Hub.Register.StartupHook('HTML-CSS Jax Ready',function () {" +
                "var VARIANT = MathJax.OutputJax['HTML-CSS'].FONTDATA.VARIANT;" +
                "VARIANT['normal'].fonts.unshift('MathJax_default');" +
                "VARIANT['bold'].fonts.unshift('MathJax_default-bold');" +
                "VARIANT['italic'].fonts.unshift('MathJax_default-italic');" +
                "VARIANT['-tex-mathit'].fonts.unshift('MathJax_default-italic');" +
            "});" +
            "MathJax.Hub.Register.StartupHook('SVG Jax Ready',function () {" +
                "var VARIANT = MathJax.OutputJax.SVG.FONTDATA.VARIANT;" +
                "VARIANT['normal'].fonts.unshift('MathJax_default');" +
                "VARIANT['bold'].fonts.unshift('MathJax_default-bold');" +
                "VARIANT['italic'].fonts.unshift('MathJax_default-italic');" +
                "VARIANT['-tex-mathit'].fonts.unshift('MathJax_default-italic');" +
            "});" +
        "}";

    (document.body || document.getElementsByTagName('head')[0]).appendChild(configscript);
    (document.body || document.getElementsByTagName('head')[0]).appendChild(mathjaxscript);
}
&lt;/script&gt;</content><category term="Foundations"></category></entry><entry><title>10 Everyday AI Workflows That Save Hours</title><link href="https://phroneses.com/articles/foundations/notes/10-things.html" rel="alternate"></link><published>2026-04-26T00:00:00+00:00</published><updated>2026-04-26T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-04-26:/articles/foundations/notes/10-things.html</id><summary type="html">&lt;p&gt;Ten simple AI workflows that save minutes each day and compound into hours each week, helping people work more efficiently.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Artificial intelligence is a practical tool that speeds up routine thinking
tasks. These ten workflows show how everyone can use it to save minutes every
day. Those minutes add up into hours each week. And practise will make you
prompt perfect.&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="1-turn-messy-notes-into-clean-summaries"&gt;1. Turn messy notes into clean summaries&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You paste a rambling 500‑word meeting transcript. The system produces a clear summary with action points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Here are my messy meeting notes. Please summarise the key decisions and list the action items clearly."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="2-draft-emails-from-bullet-points"&gt;2. Draft emails from bullet points&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You write a few rough points. The system turns them into a polished email.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Turn these bullet points into a polite, professional email: apologise for delay and ask for feedback by this Friday."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="3-explain-complex-topics-in-plain-english"&gt;3. Explain complex topics in plain English&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You paste a confusing medical letter. The system rewrites it in simple, accurate language.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Rewrite this in plain English for a non‑expert reader. Keep it accurate but simple. Do not add anything to the content."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="4-create-quick-plans-for-travel-meals-or-events"&gt;4. Create quick plans for travel, meals, or events&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You request a two‑day trip plan. The system provides a structured itinerary with alternatives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Plan a two‑day trip to Edinburgh with indoor options if it rains. Include timings."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="5-turn-long-articles-into-short-takeaways"&gt;5. Turn long articles into short takeaways&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You paste a long news article. The system produces a five‑point summary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Summarise this article into five key points and give me a one‑sentence takeaway."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="6-brainstorm-ideas-when-you-feel-stuck"&gt;6. Brainstorm ideas when you feel stuck&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You need a name for a community newsletter. The system generates several options.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Give me ten name ideas for a friendly community newsletter about local events."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="7-rewrite-text-in-different-tones"&gt;7. Rewrite text in different tones&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You paste a blunt message. The system rewrites it in a more diplomatic tone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Rewrite this message to be polite and constructive while keeping the meaning."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="8-extract-key-information-from-documents"&gt;8. Extract key information from documents&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You upload a contract. The system identifies renewal dates, obligations, and risks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Extract the key dates, obligations, and cancellation terms from this contract. Do not invent anything. Only use the data I have provided to you."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="9-create-checklists-from-goals"&gt;9. Create checklists from goals&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You want to declutter your house. The system turns this into a room‑by‑room checklist.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Turn this goal into a step‑by‑step checklist: declutter my entire house this month."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="10-turn-data-into-quick-insights"&gt;10. Turn data into quick insights&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Example&lt;/strong&gt;&lt;br/&gt;
You paste a small spreadsheet of expenses. The system highlights trends and suggests improvements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example prompt&lt;/strong&gt;&lt;br/&gt;
"Here is my monthly spending data. Identify trends and suggest three ways to reduce costs. Use only the data I have provided to you."&lt;/p&gt;
&lt;hr/&gt;
&lt;h2 id="conclusion"&gt;Conclusion&lt;/h2&gt;
&lt;p&gt;Begin with one or two workflows and expand from there. Small time savings
accumulate quickly, and these tools can help you stay organised, informed, and
in control.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="ai-chatbot-prompting.html"&gt;Ten simple AI workflows that save minutes each day and compound into hours each week, helping people work more efficiently.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="designing-ai-prompts.html"&gt;Modern AI systems require structured, multi‑step prompts that guide planning, critique, and long‑context reasoning.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="how-to-use.html"&gt;Guidance on using AI safely and effectively, grounded in recent examples of misuse and emerging best practices.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#1-turn-messy-notes-into-clean-summaries"&gt;1. Turn messy notes into clean summaries&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#2-draft-emails-from-bullet-points"&gt;2. Draft emails from bullet points&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#3-explain-complex-topics-in-plain-english"&gt;3. Explain complex topics in plain English&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#4-create-quick-plans-for-travel-meals-or-events"&gt;4. Create quick plans for travel, meals, or events&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#5-turn-long-articles-into-short-takeaways"&gt;5. Turn long articles into short takeaways&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#6-brainstorm-ideas-when-you-feel-stuck"&gt;6. Brainstorm ideas when you feel stuck&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#7-rewrite-text-in-different-tones"&gt;7. Rewrite text in different tones&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#8-extract-key-information-from-documents"&gt;8. Extract key information from documents&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#9-create-checklists-from-goals"&gt;9. Create checklists from goals&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#10-turn-data-into-quick-insights"&gt;10. Turn data into quick insights&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#conclusion"&gt;Conclusion&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;</content><category term="Foundations"></category></entry><entry><title>How to Evaluate the Output of an AI Chat Session</title><link href="https://phroneses.com/articles/foundations/notes/evaluate-ai-chatbot.html" rel="alternate"></link><published>2026-04-26T00:00:00+00:00</published><updated>2026-04-26T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-04-26:/articles/foundations/notes/evaluate-ai-chatbot.html</id><summary type="html">&lt;p&gt;A practical guide to assessing the quality, reliability, and safety of AI chat session outputs.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="how-to-evaluate-the-output-of-an-ai-chat-session"&gt;How to Evaluate the Output of an AI Chat Session&lt;/h1&gt;
&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;Many people now use chat systems powered by artificial intelligence for writing,
research, planning, or quick explanations. These systems can be helpful, but
their output varies in quality. Some responses are clear and accurate, while
others may be incomplete, misleading, or overly confident. Understanding how to
evaluate what you receive makes the experience more efficient and safer.&lt;/p&gt;
&lt;p&gt;A simple example shows why this matters. Someone might ask a chat system for a
summary of a historical event and receive a clear explanation. The same person
might then ask for a legal interpretation and receive an answer that sounds
confident but is not reliable. The difference is not always obvious from the
tone of the response.&lt;/p&gt;
&lt;h2 id="start-with-the-purpose-of-the-conversation"&gt;Start With the Purpose of the Conversation&lt;/h2&gt;
&lt;p&gt;It helps to keep in mind what you are trying to achieve. A chat system can
produce ideas, drafts, explanations, or examples very quickly. It is less
reliable when the task requires specialist judgement, up‑to‑date facts, or
precise interpretation.&lt;/p&gt;
&lt;p&gt;For instance, asking for help brainstorming a travel itinerary is usually safe.
Asking for a diagnosis based on symptoms is not. The system may sound equally
confident in both cases, so the purpose of the conversation matters.&lt;/p&gt;
&lt;h2 id="check-whether-the-output-matches-the-question"&gt;Check Whether the Output Matches the Question&lt;/h2&gt;
&lt;p&gt;Sometimes a chat system answers a slightly different question from the one you
asked. This can happen when the prompt is broad or when the system tries to
guess your intent.&lt;/p&gt;
&lt;p&gt;A simple way to check is to read the answer and ask whether it addresses the
specific point you raised. If you ask for "three reasons why a bridge design
failed" and receive a general explanation of bridge engineering, the output is
not wrong, but it is not what you asked for.&lt;/p&gt;
&lt;h2 id="look-for-verifiable-details"&gt;Look for Verifiable Details&lt;/h2&gt;
&lt;p&gt;Useful responses often contain information that can be checked. This might be a
definition, a date, a description of a process, or a reference to a known
concept. When a response includes details that can be confirmed, it becomes
easier to judge its reliability.&lt;/p&gt;
&lt;p&gt;For example, if you ask about how a particular sensor works, a good answer might
describe the physical principle behind it. If the answer instead gives vague
phrases such as "advanced technology" or "cutting edge performance", it may not
be providing real information.&lt;/p&gt;
&lt;h2 id="notice-when-the-system-sounds-certain"&gt;Notice When the System Sounds Certain&lt;/h2&gt;
&lt;p&gt;Chat systems often express ideas in a confident tone, even when the underlying
information is uncertain. This is a normal behaviour of the technology, but it
means that confidence should not be taken as a sign of accuracy.&lt;/p&gt;
&lt;p&gt;A relatable example is when someone asks for the opening hours of a local shop.
The system may provide a clear answer, but unless it has access to current
information, the hours may be outdated or incorrect. The tone does not reflect
the reliability.&lt;/p&gt;
&lt;h2 id="compare-the-output-with-what-you-already-know"&gt;Compare the Output With What You Already Know&lt;/h2&gt;
&lt;p&gt;If the response touches on a topic you understand, a quick comparison can reveal
whether the system is on the right track. If something feels inconsistent with
your knowledge, it may be worth checking further.&lt;/p&gt;
&lt;p&gt;For instance, if you ask about a programming concept you use regularly and the
answer describes it in an unfamiliar way, that is a signal to verify the
information.&lt;/p&gt;
&lt;h2 id="ask-for-clarification-or-a-different-angle"&gt;Ask for Clarification or a Different Angle&lt;/h2&gt;
&lt;p&gt;If a response seems incomplete or unclear, asking the system to explain the idea
in a different way can help. Many people find that asking for an example, a
step‑by‑step explanation, or a simpler description reveals whether the system
actually captured the idea.&lt;/p&gt;
&lt;p&gt;A practical example is when someone asks for an explanation of a financial
term. If the first answer feels abstract, asking for "a simple example using
everyday numbers" often makes the concept clearer.&lt;/p&gt;
&lt;h2 id="be-cautious-with-sensitive-or-highimpact-topics"&gt;Be Cautious With Sensitive or High‑Impact Topics&lt;/h2&gt;
&lt;p&gt;Some areas require extra care. These include medical advice, legal
interpretation, financial decisions, and safety‑critical information. Chat
systems can generate plausible text in these areas, but plausibility is not the
same as accuracy.&lt;/p&gt;
&lt;p&gt;A symptom checker example illustrates this. A system may describe a condition in
a way that sounds precise, but it cannot assess real‑world risk or context. In
such cases, the output should be treated as general information, not as a basis
for action.&lt;/p&gt;
&lt;h2 id="look-for-signs-of-fabrication"&gt;Look for Signs of Fabrication&lt;/h2&gt;
&lt;p&gt;Chat systems sometimes produce details that sound real but are not. These may
include invented citations, incorrect statistics, or descriptions of events
that never occurred. This behaviour is not intentional, but it can mislead
readers who assume the information is factual.&lt;/p&gt;
&lt;p&gt;A common example is when someone asks for a reference to a scientific paper and
receives a title and author that look plausible but do not exist. Checking the
reference quickly reveals the issue.&lt;/p&gt;
&lt;h2 id="use-the-system-as-a-tool-not-an-authority"&gt;Use the System as a Tool, Not an Authority&lt;/h2&gt;
&lt;p&gt;A chat system can be a helpful assistant for drafting, exploring ideas, or
learning about a topic. It is less suited to acting as a final source of
truth.  Treating it as a tool rather than an authority helps keep expectations
realistic and reduces the risk of relying on incorrect information.&lt;/p&gt;
&lt;h2 id="conclusion"&gt;Conclusion&lt;/h2&gt;
&lt;p&gt;Evaluating the output of an AI chat session is a practical skill. Paying
attention to the purpose of the conversation, the clarity of the answer, the
presence of verifiable details, and the sensitivity of the topic can make the
experience more effective and safer. With a few simple habits, it becomes
easier to recognise when the system is providing useful insight and when
additional checking is needed.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="how-to-use.html"&gt;Guidance on using AI safely and effectively, grounded in recent examples of misuse and emerging best practices.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="how-ai-works.html"&gt;An explanation of how large language models actually function and why they should not be treated as miniature humans.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="what-ai-is.html"&gt;A clear explanation of what AI is—and is not—cutting through hype to define its real capabilities and limits.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#how-to-evaluate-the-output-of-an-ai-chat-session"&gt;How to Evaluate the Output of an AI Chat Session&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#introduction"&gt;Introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#start-with-the-purpose-of-the-conversation"&gt;Start With the Purpose of the Conversation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#check-whether-the-output-matches-the-question"&gt;Check Whether the Output Matches the Question&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#look-for-verifiable-details"&gt;Look for Verifiable Details&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#notice-when-the-system-sounds-certain"&gt;Notice When the System Sounds Certain&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#compare-the-output-with-what-you-already-know"&gt;Compare the Output With What You Already Know&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#ask-for-clarification-or-a-different-angle"&gt;Ask for Clarification or a Different Angle&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#be-cautious-with-sensitive-or-highimpact-topics"&gt;Be Cautious With Sensitive or High‑Impact Topics&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#look-for-signs-of-fabrication"&gt;Look for Signs of Fabrication&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#use-the-system-as-a-tool-not-an-authority"&gt;Use the System as a Tool, Not an Authority&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#conclusion"&gt;Conclusion&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;</content><category term="Foundations"></category></entry><entry><title>How to Use AI Safely and Effectively</title><link href="https://phroneses.com/articles/foundations/notes/how-to-use.html" rel="alternate"></link><published>2026-04-26T00:00:00+00:00</published><updated>2026-04-26T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-04-26:/articles/foundations/notes/how-to-use.html</id><summary type="html">&lt;p&gt;Guidance on using AI safely and effectively, grounded in recent examples of misuse and emerging best practices.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc#"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Recent headlines have shown the same unsettling pattern.&lt;/p&gt;
&lt;p&gt;An AI system confidently generated legal cases that never existed, as reported when UK courts received filings built on fictitious case law (The Guardian, Scottish Legal News).&lt;/p&gt;
&lt;p&gt;Health researchers have warned that AI can give medical guidance that is not just inaccurate but dangerously misleading. A British Medical Journal article as reported in the Independent stated that 20% of AI medical answers were "highly problematic".&lt;/p&gt;
&lt;p&gt;And tech reporters have documented AI‑generated news summaries that included entirely fabricated headlines and events (Sky News).&lt;/p&gt;
&lt;p&gt;In every case, the system generated output that communicated total confidence. In every case, the AI was wrong. Fluency is not understanding. Appearing proficient is not accuracy. This confusion is exactly where the real risk lies.&lt;/p&gt;
&lt;h1 id="give-clear-instructions"&gt;Give Clear Instructions&lt;/h1&gt;
&lt;p&gt;AI works best when you tell it exactly what you want. It does not infer your intentions or read between the lines. The output you see is a statistical software prediction based on patterns in the training data of the AI. The clearer your request, the better the output.&lt;/p&gt;
&lt;p&gt;Start by stating your goal. Instead of asking, "Tell me about climate change," try: "Give me a 150‑word summary of the main causes of climate change for a general audience." A specific target gives the system's statistical pattern-matching something concrete to aim at.&lt;/p&gt;
&lt;p&gt;Set the format you want. Simple instructions like "Give me three options," "Write this as a short email," or "List the steps in order" immediately improve the result. Format acts as a constraint, and constraints make the output sharper.&lt;/p&gt;
&lt;p&gt;Define the audience. AI changes tone and detail depending on who you say it is for: beginners, executives, customers, or the general public. A single line about the audience can transform the clarity of the answer.&lt;/p&gt;
&lt;p&gt;If accuracy matters, add constraints such as "Use widely accepted information," "If you’re unsure, say so," or "Do not invent details." These reduce the risk of confident mistakes.&lt;/p&gt;
&lt;p&gt;Clear instructions make the output better and safer, but they do not eliminate the risk of mistakes. Even with perfect prompts, a system can still deliver something that sounds certain but is completely wrong.&lt;/p&gt;
&lt;p&gt;The AI is not weighing evidence or checking facts. AI is programmed to produce an answer that appears most likely based on patterns in its training data. When those patterns point in the wrong direction, the result is a confident mistake. Your prompt has to help the AI navigate any bias or missing data in its training data. Think of your prompt as you nudging the AI in the direction you want to go.&lt;/p&gt;
&lt;p&gt;When your task is large, break it into smaller steps. Ask for an outline first, then expand each section. AI performs far better when guided step‑by‑step.&lt;/p&gt;
&lt;p&gt;Clear instructions don’t just improve the output, they keep you in control of the process.&lt;/p&gt;
&lt;h1 id="provide-enough-context"&gt;Provide Enough Context&lt;/h1&gt;
&lt;p&gt;AI performs noticeably better when it has the background information it needs, such as who the audience is, what the situation involves, or what constraints apply.&lt;/p&gt;
&lt;p&gt;When context is missing, the system often fills in the gaps with incorrect predictions that will look like guesses, and recent reporting shows how easily this can go wrong. The Guardian found that Google AI Overviews gave misleading health advice because the AI responded without understanding the medical circumstances involved, including a case where it advised pancreatic cancer patients to avoid high fat foods, which experts described as really dangerous. This is dangeous advice as some who suffer from pancreatic cancer are malnourished and consuming fat can be a nutritionally efficient way to ingest energy.&lt;/p&gt;
&lt;h1 id="check-the-output-carefully"&gt;Check the Output Carefully&lt;/h1&gt;
&lt;p&gt;AI is not a source of truth, it is a generator of plausible answers, so treat every response as a draft, not a verdict.&lt;/p&gt;
&lt;p&gt;Read the answer to then ask basic questions: Does this match what you already know, does it contradict trusted sources, does anything feel too neat or too extreme?&lt;/p&gt;
&lt;p&gt;For factual topics, spot check key claims against reputable outlets or official documentation, especially numbers, names, dates, web links, and legal or medical details.&lt;/p&gt;
&lt;p&gt;For writing tasks, look for invented quotes, fake references, or details that are oddly specific without any support.&lt;/p&gt;
&lt;p&gt;If something important hinges on the answer, ask the system to show its reasoning, to list uncertainties, or to offer alternative possibilities.&lt;/p&gt;
&lt;p&gt;The core habit is simple: never confuse a confident tone with a reliable answer. Once you see the answer you can ask the AI more questions to check the reliability of that answer. This is especially important if you are going to do something that relies on that answer.&lt;/p&gt;
&lt;h1 id="use-ai-for-the-right-tasks"&gt;Use AI for the Right Tasks&lt;/h1&gt;
&lt;p&gt;AI is most effective when the work involves drafting, summarising, organising ideas, exploring options, or speeding up early stage thinking.&lt;/p&gt;
&lt;p&gt;AI can turn rough notes into a clean paragraph, reshape a long document into a shorter one, or generate several ways to frame a problem so you can choose the best one.&lt;/p&gt;
&lt;p&gt;AI is also useful for outlining reports, comparing approaches, rewriting for different audiences, or helping you see alternatives you might not have considered. These are tasks where speed and structure matter more than perfect accuracy. You can make text accurate later.&lt;/p&gt;
&lt;p&gt;AI is far less reliable when the task requires expert judgment, real world verification, or precise factual detail, so keep it focused on the parts of the job where it can genuinely help rather than the parts where it can get you into trouble.&lt;/p&gt;
&lt;p&gt;Keep in mind that AI is not thinking. AI does not check for truth. It generates plausible text based on its training data.&lt;/p&gt;
&lt;h1 id="avoid-using-ai-for-judgement-or-decisions"&gt;Avoid Using AI for Judgement or Decisions&lt;/h1&gt;
&lt;p&gt;AI cannot weigh values, consequences, or ethics, and it cannot understand the human context that sits behind real decisions.&lt;/p&gt;
&lt;p&gt;AI can offer options, outline trade offs, or summarise information, but it cannot decide what matters most, what is acceptable, or what is fair. Those choices rely on experience, responsibility, and an understanding of people, none of which an AI possesses.&lt;/p&gt;
&lt;p&gt;Use AI to support your thinking, not to replace it. Human judgement must stay in charge, especially when the outcome affects safety, wellbeing, trust, or the outcome has long term consequences.&lt;/p&gt;
&lt;h1 id="be-cautious-with-personal-or-sensitive-information"&gt;Be Cautious with Personal or Sensitive Information&lt;/h1&gt;
&lt;p&gt;Treat AI tools the same way you would treat an online form or an email to someone you do not know.&lt;/p&gt;
&lt;p&gt;Do not share details that could identify you, expose someone else, or create problems if they were ever seen by the wrong person. This includes financial information, medical records, passwords, private conversations, or anything that involves children, colleagues, or business clients.&lt;/p&gt;
&lt;p&gt;Keep the boundary simple. If you would hesitate before typing it into a website, keep it out of an AI prompt. The safest approach is to describe the situation in general terms and remove anything that is not essential to the task. This protects your privacy and prevents sensitive information from being handled in ways you cannot control.&lt;/p&gt;
&lt;h1 id="compare-answers-with-reliable-sources"&gt;Compare Answers with Reliable Sources&lt;/h1&gt;
&lt;p&gt;Treat AI output as a starting point, not a final answer, and cross check anything that matters with sources you trust.&lt;/p&gt;
&lt;p&gt;This is especially important for facts that are time sensitive, technical, or likely to change. A quick comparison with reputable news outlets, official guidance, or well established reference material can reveal errors that are easy to miss when the writing sounds polished.&lt;/p&gt;
&lt;p&gt;This habit is not about distrusting the tool, it is about protecting yourself from mistakes that come from outdated information, missing context, or confident AI guesses. When accuracy matters, a second source is not optional, it is part of the process.&lt;/p&gt;
&lt;h1 id="keep-an-eye-out-for-gaps-or-oddities"&gt;Keep an Eye Out for Gaps or Oddities&lt;/h1&gt;
&lt;p&gt;A useful habit when reading AI generated answers is to notice when something feels slightly off. This might be an explanation that is too vague, a claim that is oddly specific without support, or a confident statement that does not match what you know.&lt;/p&gt;
&lt;p&gt;When you see these signs, pause and ask a follow up question or check the detail elsewhere.&lt;/p&gt;
&lt;p&gt;Recent reporting shows how easily small oddities can signal a deeper problem. The Guardian described how a senior European journalist was suspended after using AI tools to summarise material and then publishing quotes that the people involved had never said. The investigation found dozens of invented statements that looked polished and authoritative but were entirely false, and the journalist admitted he had fallen into the trap of trusting text that only sounded right.&lt;/p&gt;
&lt;p&gt;Examples like this show why readers should stay alert to gaps, inconsistencies, or moments when an answer feels too neat. These are cues to check the AI's output.&lt;/p&gt;
&lt;h1 id="stay-aware-of-the-limits-of-ai"&gt;Stay Aware of the Limits of AI&lt;/h1&gt;
&lt;p&gt;AI does not understand meaning, it has no lived experience, and it cannot draw on intuition or common sense.&lt;/p&gt;
&lt;p&gt;AI works by recognising patterns in data and producing text that fits those patterns, not by grasping the reality behind the words. This means it can miss context, overlook nuance, or present something that sounds authoritative without any understanding.&lt;/p&gt;
&lt;p&gt;AI cannot feel uncertainty, it cannot judge what is important, and it cannot
tell when it has made a mistake. Keeping these limits in mind helps you use the
tool for what it is good at and avoid expecting it to behave like a person.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="how-ai-works.html"&gt;An explanation of how large language models actually function and why they should not be treated as miniature humans.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="evaluate-ai-chatbot.html"&gt;A practical guide to assessing the quality, reliability, and safety of AI chat session outputs.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="ai-chatbot-prompting.html"&gt;Ten simple AI workflows that save minutes each day and compound into hours each week, helping people work more efficiently.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#give-clear-instructions"&gt;Give Clear Instructions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#provide-enough-context"&gt;Provide Enough Context&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#check-the-output-carefully"&gt;Check the Output Carefully&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#use-ai-for-the-right-tasks"&gt;Use AI for the Right Tasks&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#avoid-using-ai-for-judgement-or-decisions"&gt;Avoid Using AI for Judgement or Decisions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#be-cautious-with-personal-or-sensitive-information"&gt;Be Cautious with Personal or Sensitive Information&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#compare-answers-with-reliable-sources"&gt;Compare Answers with Reliable Sources&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#keep-an-eye-out-for-gaps-or-oddities"&gt;Keep an Eye Out for Gaps or Oddities&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#stay-aware-of-the-limits-of-ai"&gt;Stay Aware of the Limits of AI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#further-reading"&gt;Further Reading&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#fake-legal-cases"&gt;Fake legal cases&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#dangerous-or-misleading-medical-advice"&gt;Dangerous or misleading medical advice&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#fabricated-news-summaries"&gt;Fabricated news summaries&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#misleading-health-advice"&gt;Misleading health advice&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#senior-european-journalist"&gt;Senior European Journalist&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;h1 id="further-reading"&gt;Further Reading&lt;/h1&gt;
&lt;h2 id="fake-legal-cases"&gt;Fake legal cases&lt;/h2&gt;
&lt;p&gt;The Guardian — &lt;a href="https://www.theguardian.com/technology/2025/jun/06/high-court-tells-uk-lawyers-to-urgently-stop-misuse-of-ai-in-legal-work"&gt;https://www.theguardian.com/technology/2025/jun/06/high-court-tells-uk-lawyers-to-urgently-stop-misuse-of-ai-in-legal-work&lt;/a&gt; &lt;br/&gt;
Scottish Legal News — &lt;a href="https://www.scottishlegal.com/articles/ai-chatbot-invented-legal-cases-in-taxpayers-failed-appeal-against-hmrc"&gt;https://www.scottishlegal.com/articles/ai-chatbot-invented-legal-cases-in-taxpayers-failed-appeal-against-hmrc&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="dangerous-or-misleading-medical-advice"&gt;Dangerous or misleading medical advice&lt;/h2&gt;
&lt;p&gt;The Independent — &lt;a href="https://www.independent.co.uk/life-style/health-and-families/health-news/chatbots-medical-advice-bmj-study-b2961005.html"&gt;https://www.independent.co.uk/life-style/health-and-families/health-news/chatbots-medical-advice-bmj-study-b2961005.html&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="fabricated-news-summaries"&gt;Fabricated news summaries&lt;/h2&gt;
&lt;p&gt;Sky News — &lt;a href="https://news.sky.com/story/apple-suspends-ai-generated-news-summaries-after-criticism-over-misleading-notifications-13290676"&gt;https://news.sky.com/story/apple-suspends-ai-generated-news-summaries-after-criticism-over-misleading-notifications-13290676&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="misleading-health-advice"&gt;Misleading health advice&lt;/h2&gt;
&lt;p&gt;The Guardian - &lt;a href="https://www.theguardian.com/technology/2026/jan/02/google-ai-overviews-risk-harm-misleading-health-information"&gt;https://www.theguardian.com/technology/2026/jan/02/google-ai-overviews-risk-harm-misleading-health-information&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="senior-european-journalist"&gt;Senior European Journalist&lt;/h2&gt;
&lt;p&gt;The Guardian - &lt;a href="https://www.theguardian.com/technology/2026/mar/20/mediahuis-suspends-senior-journalist-over-ai-generated-quotes?utm_source=copilot.com"&gt;https://www.theguardian.com/technology/2026/mar/20/mediahuis-suspends-senior-journalist-over-ai-generated-quotes?utm_source=copilot.com&lt;/a&gt;&lt;/p&gt;</content><category term="Foundations"></category></entry><entry><title>What AI Is (and Isn't)</title><link href="https://phroneses.com/articles/foundations/notes/what-ai-is.html" rel="alternate"></link><published>2026-04-26T00:00:00+00:00</published><updated>2026-04-26T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-04-26:/articles/foundations/notes/what-ai-is.html</id><summary type="html">&lt;p&gt;A clear explanation of what AI is—and is not—cutting through hype to define its real capabilities and limits.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We have all read the articles about our AI future: "AI will take your job".&lt;/p&gt;
&lt;p&gt;This article takes a different path to explain AI clearly, simply, and honestly.&lt;/p&gt;
&lt;h1 id="a-straightforward-definition-of-ai"&gt;A Straightforward Definition of AI&lt;/h1&gt;
&lt;p&gt;AI software learns patterns from lots of examples. Once it has been exposed to those patterns, it can create new text.&lt;/p&gt;
&lt;p&gt;When you ask something like "What is the weather going to do in Glasgow tomorrow?", the AI does not read the sentence the way a human does. Instead, it turns your words into numbers,&lt;/p&gt;
&lt;p&gt;Using these, the AI programming looks for relationships in the sentence. Words like "weather," "tomorrow," and "Glasgow" stand out because they are the important parts of your question.&lt;/p&gt;
&lt;p&gt;Next, the AI uses the data it was trained on (the examples) to statistically evaluate what your question is about. It does not "understand" the way people do, it just recognises patterns it has seen before.&lt;/p&gt;
&lt;p&gt;To create an answer, the AI predicts what should come next, one token at a time. A token might be a word, part of a word, or punctuation. The AI chooses the most likely next token based on patterns in its training data.&lt;/p&gt;
&lt;p&gt;This statistical selection can look like reasoning, but it is really pattern‑matching. If the AI was never trained on weather‑related information, it would not be able to give you a good answer. There would be no tokens on which to base its output.&lt;/p&gt;
&lt;p&gt;Because weather changes constantly, the AI system accesses real weather data from an external source. This is how it can give you an accurate, up‑to‑date forecast instead of basing its output on general Glasgow weather.&lt;/p&gt;
&lt;p&gt;Finally, the AI program puts everything together: your question, the patterns it has learned, the conversation so far, and the real weather data, to generate the output you see.&lt;/p&gt;
&lt;h1 id="but-is-it-intelligent"&gt;But is it Intelligent?&lt;/h1&gt;
&lt;p&gt;AI might sound intelligent, but it does not have consciousness, intentions, or real understanding. It does not know things or have opinions. All the AI program is doing is
recognising patterns in data and using those patterns to produce output.&lt;/p&gt;
&lt;p&gt;When an AI responds, it is not thinking or wanting anything; it is just following statistical cues from the data it was previously shown.&lt;/p&gt;
&lt;p&gt;AI can be incredibly powerful, but it is still just a tool. It does not think or decide things on its own. It can only work with the patterns and data it has been given.&lt;/p&gt;
&lt;p&gt;The value of AI comes from how people choose to use it, not from any independent ability or intention.&lt;/p&gt;
&lt;p&gt;When you type a message on your phone and it suggests the next word, your phone is not thinking. The program in your phone is suggesting a good possible next word based on patterns it has seen before. AI works the same way, just on a much larger scale.&lt;/p&gt;
&lt;p&gt;AI predicts what could reasonably come next in a sentence, an image, or an answer, using patterns learned from huge amounts of training data. AI can be incredibly helpful, but it is still predicting based on patterns, not understanding the world. Without the huge amounts of data, AI would have no patterns to base an answer on.&lt;/p&gt;
&lt;p&gt;Now that we have covered how AI works, here is what it can actually do well.&lt;/p&gt;
&lt;h1 id="what-ai-is-good-at"&gt;What AI Is Good At&lt;/h1&gt;
&lt;p&gt;As AI is programmed to find patterns in huge amounts of data, an AI can easily take long documents and turn them into shorter versions, based on patterns that produce clearer text.&lt;/p&gt;
&lt;p&gt;AI is great for drafting emails, rewriting paragraphs, producing variations, or helping with early versions of content.&lt;/p&gt;
&lt;p&gt;When the topic is something it has seen many examples of (such as a question about the weather), AI can give fast, reliable answers.&lt;/p&gt;
&lt;p&gt;And the vast amount of data AI is trained on means AIs are great at classification, translation, sorting, and extracting key details from text. AIs have seen so many examples, their statistical prediction can appear like it has vast knowledge. But an AI is only selecting a statistical match.&lt;/p&gt;
&lt;p&gt;AI is good at giving options, exploring possible approaches, and speeding up early‑stage work. But, AI still needs human judgement to decide whether what has been produced is of any value.&lt;/p&gt;
&lt;p&gt;There are also clear limits that are important to understand.&lt;/p&gt;
&lt;h1 id="what-ai-is-not-good-at"&gt;What AI Is Not Good At&lt;/h1&gt;
&lt;p&gt;AI recognises patterns, not ideas. AI does not understand what you type or what
it outputs.&lt;/p&gt;
&lt;p&gt;If your question is vague, emotional, or depends on context only humans share, AI often predicts incorrectly. Such a response is the AI program selecting an incorrect prediction based on its statistics.&lt;/p&gt;
&lt;p&gt;AI cannot weigh consequences, values, ethics, or trade‑offs. It can only follow patterns in data. As it does not understand in the human sense, AI cannot perform judgement. Judgement requires intent, values, responsibility, and lived experience. AI has none of these.&lt;/p&gt;
&lt;p&gt;However, AI can simulate judgement extremely well because it has access to vast patterns of expert reasoning, it can structure arguments, and it can select options based on criteria you give it. But this is not judgment. It is pattern-based statistical selection without understanding.&lt;/p&gt;
&lt;p&gt;AI can remix and generate new combinations, but it does not have taste, purpose, or a point of view.&lt;/p&gt;
&lt;p&gt;Anything involving physical experience, social cues, or human behaviour is outside its reach. If you say, "My car has a flat tyre," a person knows that the car
cannot be driven safely, that to fix it you will need tools and that the fix is inconvenient and messy.&lt;/p&gt;
&lt;p&gt;An AI has never changed a tyre. It does not know weight, effort, or danger. It only has access to what people have written about flat tyres.&lt;/p&gt;
&lt;p&gt;An AI can describe the steps to fix the flat (as a person has written about this in the past and this writing is in the training data), but AI does not understand the situation.&lt;/p&gt;
&lt;p&gt;An AI has no lived experience, so it can miss things a person might notice. If someone says, "I brought a bottle of wine to the dinner," a person knows this is a polite gesture. AI does not know social customs, it only has access to training data about customs written by a person.&lt;/p&gt;
&lt;h1 id="your-ai-does-not-know-anything"&gt;Your AI does not know anything&lt;/h1&gt;
&lt;p&gt;AI can sound confident even when it is completely mistaken, because it does not know what it does not know.&lt;/p&gt;
&lt;p&gt;If you ask for restaurant recommendations in a town that does not exist, some AIs may still try to answer, giving you incorrect information as the town does not exist.&lt;/p&gt;
&lt;p&gt;When an AI lacks information, it cannot feel uncertainty or recognise gaps the way people do, so it simply produces the most plausible‑sounding answer based on the patterns it currently has access to.&lt;/p&gt;
&lt;p&gt;An AI might confidently state that Venus has two moons, or invent a law that does not exist or describe an imaginary species as if it were real. Because AI never checks facts or senses its own limits, its pattern‑filling behaviour leads to "hallucinations," where the AI creates details, sources, or events that sound right but are not true.&lt;/p&gt;
&lt;p&gt;If the training data is thin, biased, or missing, the output will be unreliable, no matter how polished the output looks.&lt;/p&gt;
&lt;p&gt;If you ask an AI about something that barely exists in its training data — say, "What dishes are served at the Spring Feast in Millford Glen?", the AI will not calculate that the place or event is fictional.&lt;/p&gt;
&lt;p&gt;With nothing solid to draw from, the AI's program uses loose patterns and produces something that only sounds right, like "They usually serve herb stew and blossom cakes." The answer feels plausible, but it is really just the AI making a poor prediction because the information is too thin.&lt;/p&gt;
&lt;h1 id="the-biggest-misconceptions-about-ai"&gt;The Biggest Misconceptions About AI&lt;/h1&gt;
&lt;p&gt;Many people believe AI thinks, understands, or decides in the way a person does, but this is not the case. AI does not grasp meaning, hold values, or judge situations. It only reflects patterns in the material it was trained on.&lt;/p&gt;
&lt;p&gt;Another misconception is that AI has reliable knowledge about everything. When information is scarce, it often fills the gaps with predictions that sound believable but are not accurate. AI has access to vast data stores. AI has no knowledge, just data and a program to spot patterns.&lt;/p&gt;
&lt;p&gt;People also assume AI is neutral, yet it inherits the biases and assumptions present in its training data. Some imagine AI as a step toward consciousness, but it has no awareness or sense of self. It is a powerful tool, but still a tool, and it must be used with a clear understanding of its limits.&lt;/p&gt;
&lt;h1 id="how-to-use-ai-safely-and-effectively"&gt;How to Use AI Safely and Effectively&lt;/h1&gt;
&lt;p&gt;Using AI safely and effectively starts with treating it as a helpful assistant rather than an authority. It works best when you give it clear instructions, specific goals, and enough context to guide the response.&lt;/p&gt;
&lt;p&gt;It is important to check the information it provides, especially when accuracy matters, because it can sound confident even when it is mistaken.&lt;/p&gt;
&lt;p&gt;AI is strongest when you use it to explore ideas, draft material, summarise information, or speed up routine tasks, while keeping final judgement for yourself.&lt;/p&gt;
&lt;p&gt;AI can boost your creativity, improve your productivity, and help you think in new ways, as long as you stay aware of its limits and verify anything that needs to be correct.&lt;/p&gt;
&lt;h1 id="what-to-keep-in-mind-about-ai"&gt;What to Keep in Mind About AI&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;AI recognises patterns but does not understand meaning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;It predicts what should come next based on data it has seen.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;It is strong at summarising, drafting, sorting, and exploring ideas.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;It struggles with judgement, context, emotions, and real‑world experience.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;It can sound confident even when it is wrong.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;It works best when you guide it, check its output, and stay in control.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h1 id="a-simple-mental-model-to-remember"&gt;A Simple Mental Model to Remember&lt;/h1&gt;
&lt;p&gt;Think of AI as a very capable assistant that is excellent at helping you
create, explore, and organise ideas, but one that still needs you to guide it
and check its work.&lt;/p&gt;
&lt;p&gt;AI is powerful but not magical. It recognises patterns but does not understand.
You get the best results when you guide it, check its work, and stay in
control.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="how-ai-works.html"&gt;An explanation of how large language models actually function and why they should not be treated as miniature humans.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="evaluate-ai-claims.html"&gt;A framework for evaluating claims made about AI systems, focusing on evidence, capability, and verifiable performance.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="how-to-use.html"&gt;Guidance on using AI safely and effectively, grounded in recent examples of misuse and emerging best practices.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#a-straightforward-definition-of-ai"&gt;A Straightforward Definition of AI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#but-is-it-intelligent"&gt;But is it Intelligent?&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#what-ai-is-good-at"&gt;What AI Is Good At&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#what-ai-is-not-good-at"&gt;What AI Is Not Good At&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#your-ai-does-not-know-anything"&gt;Your AI does not know anything&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#the-biggest-misconceptions-about-ai"&gt;The Biggest Misconceptions About AI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#how-to-use-ai-safely-and-effectively"&gt;How to Use AI Safely and Effectively&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#what-to-keep-in-mind-about-ai"&gt;What to Keep in Mind About AI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#a-simple-mental-model-to-remember"&gt;A Simple Mental Model to Remember&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;</content><category term="Foundations"></category></entry><entry><title>A Beginner's Guide to AI Chatbot Prompting</title><link href="https://phroneses.com/articles/foundations/notes/ai-chatbot-prompting.html" rel="alternate"></link><published>2026-04-22T00:00:00+00:00</published><updated>2026-04-22T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2026-04-22:/articles/foundations/notes/ai-chatbot-prompting.html</id><summary type="html">&lt;p&gt;Clear, practical prompting habits to help you get faster, more reliable results from everyday AI tasks.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="a-beginners-guide-to-ai-chatbot-prompting"&gt;A Beginner’s Guide to AI Chatbot Prompting&lt;/h1&gt;
&lt;p&gt;This guide gives beginners a clear, practical foundation for working with AI chatbots. Each section focuses on one skill, why it matters, and how to apply it.&lt;/p&gt;
&lt;h2 id="1-what-prompting-is-and-why-it-matters"&gt;1. What Prompting Is and Why It Matters&lt;/h2&gt;
&lt;p&gt;Prompting is the skill of giving clear instructions to a chatbot so that
you are more likely to get a useful response.&lt;/p&gt;
&lt;p&gt;Good prompts will reduce confusion and save you time. A poor
prompt can waste time as you work you way through an answer that does
not hit the spot.&lt;/p&gt;
&lt;p&gt;Example:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Vague: "Explain photosynthesis"&lt;/li&gt;
&lt;li&gt;Clear: "Explain photosynthesis in simple terms for a 12‑year‑old"&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you try these you will see that the second one is a completelt different
response from the first. It is more direct and easier to read.&lt;/p&gt;
&lt;h2 id="2-start-with-a-direct-request"&gt;2. Start With a Direct Request&lt;/h2&gt;
&lt;p&gt;A simple, explicit request sets the direction.&lt;/p&gt;
&lt;p&gt;Examples:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Write a short summary of this article"&lt;/li&gt;
&lt;li&gt;"Give me three ideas for a birthday message"&lt;/li&gt;
&lt;li&gt;"Explain how this code works"&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;With the short summary prompt, startinmg on a new line, pase in the
article you are referring to.&lt;/p&gt;
&lt;h2 id="3-add-context-to-aim-the-response"&gt;3. Add Context to Aim the Response&lt;/h2&gt;
&lt;p&gt;Context helps the chatbot match your level, purpose, or constraints.&lt;/p&gt;
&lt;p&gt;Examples:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"I am new to London, UK. Explain what I can do on a wet Sunday."&lt;/li&gt;
&lt;li&gt;"I am preparing for a job interview. Give me sample questions."&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;London, UK is specified to keep the prompt clear as there are many
places in the world called London. How many?&lt;/p&gt;
&lt;p&gt;"Give the total number of places in the world called London, no variants. List the names"&lt;/p&gt;
&lt;h2 id="4-specify-the-format-you-want"&gt;4. Specify the Format You Want&lt;/h2&gt;
&lt;p&gt;Format guides structure and makes the output easier to use.&lt;/p&gt;
&lt;p&gt;Examples:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Give me a bullet‑point list"&lt;/li&gt;
&lt;li&gt;"Write a short paragraph"&lt;/li&gt;
&lt;li&gt;"Produce a step‑by‑step explanation"&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="5-set-clear-constraints"&gt;5. Set Clear Constraints&lt;/h2&gt;
&lt;p&gt;Constraints keep the answer focused and predictable.&lt;/p&gt;
&lt;p&gt;Examples:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Keep it under 150 words"&lt;/li&gt;
&lt;li&gt;"Use plain English"&lt;/li&gt;
&lt;li&gt;"No jargon"&lt;/li&gt;
&lt;li&gt;"Be concise"&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="6-use-examples-to-anchor-tone-and-style"&gt;6. Use Examples to Anchor Tone and Style&lt;/h2&gt;
&lt;p&gt;Examples show the chatbot what "good" looks like.&lt;/p&gt;
&lt;p&gt;Example:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Write it in the style of this: 'Short, direct, and practical.'"&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="7-adjust-over-time-instead-of-restarting"&gt;7. Adjust Over Time Instead of Restarting&lt;/h2&gt;
&lt;p&gt;Treat the chatbot as a collaborator. Adjust the output rather than rewriting the whole prompt.&lt;/p&gt;
&lt;p&gt;Examples:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Shorten this"&lt;/li&gt;
&lt;li&gt;"Make it more formal"&lt;/li&gt;
&lt;li&gt;"Add one more example in the first paragraph"&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="8-ask-for-alternatives-when-you-need-options"&gt;8. Ask for Alternatives When You Need Options&lt;/h2&gt;
&lt;p&gt;Variations help you compare and choose.&lt;/p&gt;
&lt;p&gt;Examples:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Give me two more options"&lt;/li&gt;
&lt;li&gt;"Rewrite this with a friendlier tone"&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="9-break-complex-tasks-into-steps"&gt;9. Break Complex Tasks Into Steps&lt;/h2&gt;
&lt;p&gt;Step‑by‑step prompting keeps large tasks managoeable.&lt;/p&gt;
&lt;p&gt;AI chatbots are pattern matching. If your prompt is long, the AI may appear 
to skip something you say as it does not have a strong pattern to match to it.&lt;/p&gt;
&lt;p&gt;Example:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"First, outline the structure. Then we will fill in each section."&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="10-common-mistakes-to-avoid"&gt;10. Common Mistakes to Avoid&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Being too vague&lt;/li&gt;
&lt;li&gt;Asking for everything at once&lt;/li&gt;
&lt;li&gt;Forgetting to specify the audience&lt;/li&gt;
&lt;li&gt;Not having the AI give examples&lt;/li&gt;
&lt;li&gt;Expecting perfect output on the first try&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="11-quick-prompt-templates"&gt;11. Quick Prompt Templates&lt;/h2&gt;
&lt;p&gt;These templates give learners a starting point that you can adapt.&lt;/p&gt;
&lt;h3 id="explain-something"&gt;Explain Something&lt;/h3&gt;
&lt;p&gt;"Explain [topic] to [audience] in [format]. Keep it [constraints]."&lt;/p&gt;
&lt;p&gt;"Explain beaches to a 10 year-old in one pargraph. Keep it positive and clear." &lt;br/&gt;
"Explain beaches to an adult in one pargraph. Keep it positive and clear." &lt;br/&gt;
"Explain beaches."&lt;/p&gt;
&lt;h3 id="rewrite-something"&gt;Rewrite Something&lt;/h3&gt;
&lt;p&gt;"Rewrite this text to be more [tone]. Keep the meaning the same."&lt;/p&gt;
&lt;p&gt;"Give first line of Pride and Prejudice by Jane Austen." &lt;br/&gt;
"Rewrite using corporate speak. Keep the meaning the same but push the buzzwords to 11."&lt;/p&gt;
&lt;h3 id="generate-ideas"&gt;Generate Ideas&lt;/h3&gt;
&lt;p&gt;"Give me [number] ideas for [goal]. Keep them practical."o&lt;/p&gt;
&lt;p&gt;"Give me 5 ideas for walking down the sidewalk. Keep them practical."&lt;/p&gt;
&lt;h3 id="troubleshoot"&gt;Troubleshoot&lt;/h3&gt;
&lt;p&gt;"I am seeing this issue: [a detailed description]. Give me possible causes and simple steps to check."&lt;/p&gt;
&lt;p&gt;"I am seeing this issue: my grass is too yellow. Give me possible causes and simple checks to check."&lt;/p&gt;
&lt;h2 id="12-practice-prompts"&gt;12. Practice Prompts&lt;/h2&gt;
&lt;p&gt;Use these to build confidence and develop prompting habits.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;"Explain how a mortgage works as if I am new to finance."&lt;/li&gt;
&lt;li&gt;"Give me three ways to describe my job in a CV. I have pasted my CV."&lt;/li&gt;
&lt;li&gt;"Summarise the following paragraph in one sentence."&lt;/li&gt;
&lt;li&gt;"Suggest improvements to this email without changing the intent."&lt;/li&gt;
&lt;/ul&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="10-things.html"&gt;Ten simple AI workflows that save minutes each day and compound into hours each week, helping people work more efficiently.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="designing-ai-prompts.html"&gt;Modern AI systems require structured, multi‑step prompts that guide planning, critique, and long‑context reasoning.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="evaluate-ai-chatbot.html"&gt;A practical guide to assessing the quality, reliability, and safety of AI chat session outputs.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#a-beginners-guide-to-ai-chatbot-prompting"&gt;A Beginner’s Guide to AI Chatbot Prompting&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#1-what-prompting-is-and-why-it-matters"&gt;1. What Prompting Is and Why It Matters&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#2-start-with-a-direct-request"&gt;2. Start With a Direct Request&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#3-add-context-to-aim-the-response"&gt;3. Add Context to Aim the Response&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#4-specify-the-format-you-want"&gt;4. Specify the Format You Want&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#5-set-clear-constraints"&gt;5. Set Clear Constraints&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#6-use-examples-to-anchor-tone-and-style"&gt;6. Use Examples to Anchor Tone and Style&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#7-adjust-over-time-instead-of-restarting"&gt;7. Adjust Over Time Instead of Restarting&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#8-ask-for-alternatives-when-you-need-options"&gt;8. Ask for Alternatives When You Need Options&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#9-break-complex-tasks-into-steps"&gt;9. Break Complex Tasks Into Steps&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#10-common-mistakes-to-avoid"&gt;10. Common Mistakes to Avoid&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#11-quick-prompt-templates"&gt;11. Quick Prompt Templates&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#explain-something"&gt;Explain Something&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#rewrite-something"&gt;Rewrite Something&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#generate-ideas"&gt;Generate Ideas&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#troubleshoot"&gt;Troubleshoot&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#12-practice-prompts"&gt;12. Practice Prompts&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;</content><category term="Foundations"></category></entry><entry><title>How to Evaluate A Company's AI Claims</title><link href="https://phroneses.com/articles/foundations/notes/evaluate-ai-claims.html" rel="alternate"></link><published>2025-01-01T00:00:00+00:00</published><updated>2025-01-01T00:00:00+00:00</updated><author><name>JH Evans</name></author><id>tag:phroneses.com,2025-01-01:/articles/foundations/notes/evaluate-ai-claims.html</id><summary type="html">&lt;p&gt;A framework for evaluating claims made about AI systems, focusing on evidence, capability, and verifiable performance.&lt;/p&gt;</summary><content type="html">&lt;p&gt;&lt;a href="#toc"&gt;Table of contents&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="how-to-evaluate-claims-made-about-an-ai-based-system"&gt;How to Evaluate Claims Made About an AI-based System&lt;/h1&gt;
&lt;h2 id="introduction"&gt;Introduction&lt;/h2&gt;
&lt;p&gt;Artificial intelligence now appears in many areas of daily life. It is used in
search engines, writing tools, customer service systems, healthcare
applications, and many other services. Many people encounter it without
thinking about it, such as when a phone suggests a reply to a message or when
an ecommerce website summarises customer feedback about a product.&lt;/p&gt;
&lt;p&gt;Public descriptions of systems based in part or whole on AI often highlight
ambitious capabilities.  Some describe their products as human level, fully
autonomous, or capable of replacing expert judgement.&lt;/p&gt;
&lt;p&gt;Promotional language and real performance do not always align, which makes it
useful to look closely at how such claims are formed.&lt;/p&gt;
&lt;h2 id="understanding-the-claim"&gt;Understanding the Claim&lt;/h2&gt;
&lt;p&gt;The first step is to understand what is actually being promised.&lt;/p&gt;
&lt;p&gt;Many statements about artificial intelligence are broad or ambiguous, so it is
useful to translate them into specific questions. A claim such as "our tool
detects fraud" sounds clear, but it raises many questions about what kind of
fraud, in what context, and with what level of accuracy.&lt;/p&gt;
&lt;p&gt;Many people begin by considering what task the system is meant to perform,
under what conditions it is expected to work, how well it performs that task,
and what it is being compared against. Once the claim is expressed in concrete
terms, it becomes much easier to evaluate.&lt;/p&gt;
&lt;h2 id="looking-for-evidence"&gt;Looking for Evidence&lt;/h2&gt;
&lt;p&gt;Claims about performance usually rest on some form of evidence. A credible
statement about artificial intelligence is supported by clear information about
how the system was tested.&lt;/p&gt;
&lt;p&gt;Independent evaluations, published research, recognised benchmarks, and real
world trials all provide meaningful support. For example, a reading
comprehension benchmark or a driving simulation can show how a system behaves
under controlled conditions. By contrast, phrases such as "industry leading
accuracy" or "our internal tests show excellent results" offer very little
without further detail.&lt;/p&gt;
&lt;p&gt;Reliability often depends on who carried out the measurement and how the
testing was designed.&lt;/p&gt;
&lt;h2 id="considering-the-data"&gt;Considering the Data&lt;/h2&gt;
&lt;p&gt;Every artificial intelligence system depends heavily on the data used to train
it.&lt;/p&gt;
&lt;p&gt;The quality, diversity, and representativeness of that data shape the system’s
strengths and weaknesses. A photo classifier trained mostly on daytime images
may struggle with night scenes, and a language tool trained mainly on formal
writing may find slang or informal messages difficult to interpret.&lt;/p&gt;
&lt;p&gt;When assessing a claim, it is worth asking whether the data reflects the real
world situations in which the system will be used. Narrow or unrepresentative
data can limit how well the system performs in real situations.&lt;/p&gt;
&lt;h2 id="recognising-limitations"&gt;Recognising Limitations&lt;/h2&gt;
&lt;p&gt;All systems have limitations, and responsible companies acknowledge them.&lt;/p&gt;
&lt;p&gt;It is helpful to look for information about situations where the system
performs poorly, where it may misinterpret inputs, or where it may produce
incorrect or misleading results. A voice assistant that mishears a request
because of background noise is a simple example of how small changes in
context can affect performance.&lt;/p&gt;
&lt;p&gt;Balanced descriptions usually include both strengths and known limitations.&lt;/p&gt;
&lt;h2 id="avoiding-human-like-descriptions-of-ai"&gt;Avoiding Human-like Descriptions of AI&lt;/h2&gt;
&lt;p&gt;Marketing language sometimes presents artificial intelligence in ways that
resemble human thinking.&lt;/p&gt;
&lt;p&gt;Words such as "understands", "reasons", or "knows" can create an impression
that the system possesses abilities it does not have. A system that predicts
the next word in a sentence may appear to "understand" the topic, but it is
following patterns rather than forming ideas.&lt;/p&gt;
&lt;p&gt;A more accurate approach is to focus on what the system actually does, how it
processes inputs, how it generates outputs, and how it behaves under different
conditions.&lt;/p&gt;
&lt;h2 id="seeking-independent-validation"&gt;Seeking Independent Validation&lt;/h2&gt;
&lt;p&gt;Independent evaluations often provide a clearer picture of how a system
performs.&lt;/p&gt;
&lt;p&gt;When researchers, regulators, journalists, or external auditors have examined a
system, their findings provide a valuable counterbalance to promotional
material.&lt;/p&gt;
&lt;p&gt;Real world deployment is equally important. A navigation app may work
perfectly in a staged demonstration, but everyday use can involve roadworks,
poor signal, or unexpected detours that reveal weaknesses.&lt;/p&gt;
&lt;p&gt;Genuine reliability is shown through consistent performance with diverse users
and unpredictable inputs.&lt;/p&gt;
&lt;h2 id="considering-the-consequences-of-error"&gt;Considering the Consequences of Error&lt;/h2&gt;
&lt;p&gt;It is important to consider the consequences of error. Some tasks are low risk,
while others involve significant personal, financial, or social impact.&lt;/p&gt;
&lt;p&gt;A system used for entertainment can tolerate occasional mistakes. A music
recommendation that misses the mark is usually harmless.&lt;/p&gt;
&lt;p&gt;A system used for medical advice, financial decisions, or legal interpretation
requires far stronger evidence and clear safeguards. A symptom checker that
offers an overly confident suggestion illustrates how errors can matter more in
high stakes settings.&lt;/p&gt;
&lt;p&gt;The impact of errors can vary widely, so the way a system handles mistakes
often shapes how it should be used.&lt;/p&gt;
&lt;h2 id="the-importance-of-transparency"&gt;The Importance of Transparency&lt;/h2&gt;
&lt;p&gt;Transparency and accountability are essential qualities.&lt;/p&gt;
&lt;p&gt;Companies who provide clear explanations, publish evaluation results, describe
limitations, and offer channels for feedback demonstrate a commitment to
responsible practice.&lt;/p&gt;
&lt;p&gt;Greater transparency makes it easier to understand how a system works and how
its results should be interpreted. For example, a tool that explains which
factors influenced a recommendation gives users a clearer sense of how to
interpret the output.&lt;/p&gt;
&lt;h2 id="a-practical-way-to-judge-a-claim"&gt;A Practical Way to Judge a Claim&lt;/h2&gt;
&lt;p&gt;These themes often lead people to consider questions about what is being
promised, what evidence supports it, and how the system behaves in real
conditions.&lt;/p&gt;
&lt;p&gt;It is useful to ask what is being promised, what evidence supports the promise,
who carried out the evaluation, what data was used, what limitations are
acknowledged, whether the system has been tested independently, how it performs
outside controlled demonstrations, and what the consequences are if it fails.&lt;/p&gt;
&lt;p&gt;This is a long list, but systems powered in some way by artificial intelligence
are becoming more common and tehy are having a larger impact on everyday life.o&lt;/p&gt;
&lt;p&gt;If we are all better placed to evaluate AI-based systems, the better.&lt;/p&gt;
&lt;p&gt;If several of these questions cannot be answered, any claim is possibly likely
to be overstated.&lt;/p&gt;
&lt;h2 id="conclusion"&gt;Conclusion&lt;/h2&gt;
&lt;p&gt;Artificial intelligence is a powerful set of technologies, but it is not magic.&lt;/p&gt;
&lt;p&gt;Careful consideration and evaluation makes it easier to distinguish genuine
progress from exaggerated claims.&lt;/p&gt;
&lt;h1 id="related-work"&gt;Related Work&lt;/h1&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="how-ai-works.html"&gt;An explanation of how large language models actually function and why they should not be treated as miniature humans.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="what-ai-is.html"&gt;A clear explanation of what AI is—and is not—cutting through hype to define its real capabilities and limits.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="evaluate-ai-chatbot.html"&gt;A practical guide to assessing the quality, reliability, and safety of AI chat session outputs.&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a id="toc"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h1 id="table-of-contents"&gt;Table of Contents&lt;/h1&gt;
&lt;div class="toc"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="#how-to-evaluate-claims-made-about-an-ai-based-system"&gt;How to Evaluate Claims Made About an AI-based System&lt;/a&gt;&lt;ul&gt;
&lt;li&gt;&lt;a href="#introduction"&gt;Introduction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#understanding-the-claim"&gt;Understanding the Claim&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#looking-for-evidence"&gt;Looking for Evidence&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#considering-the-data"&gt;Considering the Data&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#recognising-limitations"&gt;Recognising Limitations&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#avoiding-human-like-descriptions-of-ai"&gt;Avoiding Human-like Descriptions of AI&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#seeking-independent-validation"&gt;Seeking Independent Validation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#considering-the-consequences-of-error"&gt;Considering the Consequences of Error&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#the-importance-of-transparency"&gt;The Importance of Transparency&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#a-practical-way-to-judge-a-claim"&gt;A Practical Way to Judge a Claim&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#conclusion"&gt;Conclusion&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a href="#related-work"&gt;Related Work&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="#table-of-contents"&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;</content><category term="Foundations"></category></entry></feed>