Download as pdf or txt
Download as pdf or txt
You are on page 1of 58

Creative Prototyping with Generative

AI: Augmenting Creative Workflows


with Generative AI (Design Thinking)
1st Edition Patrick Parra Pennefather
Visit to download the full and correct content document:
https://ebookmass.com/product/creative-prototyping-with-generative-ai-augmenting-cr
eative-workflows-with-generative-ai-design-thinking-1st-edition-patrick-parra-pennefat
her/
More products digital (pdf, epub, mobi) instant
download maybe you interests ...

The AI Revolution in Project Management: Elevating


Productivity with Generative AI 1st Edition Kanabar

https://ebookmass.com/product/the-ai-revolution-in-project-
management-elevating-productivity-with-generative-ai-1st-edition-
kanabar/

Interfaceless: Conscious Design for Spatial Computing


with Generative AI (Design Thinking) 1st Edition Diana
Olynick

https://ebookmass.com/product/interfaceless-conscious-design-for-
spatial-computing-with-generative-ai-design-thinking-1st-edition-
diana-olynick/

Generative AI: How ChatGPT and Other AI Tools Will


Revolutionize Business 1st Edition Tom Taulli

https://ebookmass.com/product/generative-ai-how-chatgpt-and-
other-ai-tools-will-revolutionize-business-1st-edition-tom-
taulli/

Applied Generative AI for Beginners: Practical


Knowledge on Diffusion Models, ChatGPT, and Other LLMs
1st Edition Akshay Kulkarni

https://ebookmass.com/product/applied-generative-ai-for-
beginners-practical-knowledge-on-diffusion-models-chatgpt-and-
other-llms-1st-edition-akshay-kulkarni/
Geometric Patterns with Creative Coding 1st Edition
Selcuk Artut

https://ebookmass.com/product/geometric-patterns-with-creative-
coding-1st-edition-selcuk-artut/

Applied Generative AI for Beginners: Practical


Knowledge on Diffusion Models, ChatGPT, and Other LLMs
Akshay Kulkarni

https://ebookmass.com/product/applied-generative-ai-for-
beginners-practical-knowledge-on-diffusion-models-chatgpt-and-
other-llms-akshay-kulkarni/

Productionizing AI: How to Deliver AI B2B Solutions


with Cloud and Python 1st Edition Barry Walsh

https://ebookmass.com/product/productionizing-ai-how-to-deliver-
ai-b2b-solutions-with-cloud-and-python-1st-edition-barry-walsh/

Productionizing AI: How to Deliver AI B2B Solutions


with Cloud and Python 1st Edition Barry Walsh

https://ebookmass.com/product/productionizing-ai-how-to-deliver-
ai-b2b-solutions-with-cloud-and-python-1st-edition-barry-walsh-2/

Leading with AI and Analytics Eric Anderson

https://ebookmass.com/product/leading-with-ai-and-analytics-eric-
anderson/
DESIGN
THINKING
SERIES

Creative
Prototyping with
Generative AI
Augmenting Creative Workflows
with Generative AI

Patrick Parra Pennefather
Design Thinking
Design Thinking is a set of strategic and creative processes and principles
used in the planning and creation of products and solutions to human-
centered design problems.
With design and innovation being two key driving principles, this
series focuses on, but not limited to, the following areas and topics:

• User Interface (UI) and User Experience (UX) Design

• Psychology of Design

• Human-Computer Interaction (HCI)

• Ergonomic Design

• Product Development and Management

• Virtual and Mixed Reality (VR/XR)

• User-Centered Built Environments and Smart Homes

• Accessibility, Sustainability and Environmental Design

• Learning and Instructional Design

• Strategy and best practices

This series publishes books aimed at designers, developers, storytellers


and problem-­solvers in industry to help them understand current
developments and best practices at the cutting edge of creativity, to invent
new paradigms and solutions, and challenge Creatives to push boundaries
to design bigger and better than before.

More information about this series at https://link.springer.com/


bookseries/15933.
Creative Prototyping
with Generative AI
Augmenting Creative
Workflows with Generative AI

Patrick Parra Pennefather


Creative Prototyping with Generative AI: Augmenting Creative Workflows
with Generative AI
Patrick Parra Pennefather
University of British Columbia
Vancouver, BC, Canada

ISBN-13 (pbk): 978-1-4842-9578-6 ISBN-13 (electronic): 978-1-4842-9579-3


https://doi.org/10.1007/978-1-4842-9579-3
Copyright © 2023 by Patrick Parra Pennefather
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or
part of the material is concerned, specifically the rights of translation, reprinting, reuse of
illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way,
and transmission or information storage and retrieval, electronic adaptation, computer software,
or by similar or dissimilar methodology now known or hereafter developed.
Trademarked names, logos, and images may appear in this book. Rather than use a trademark
symbol with every occurrence of a trademarked name, logo, or image we use the names, logos,
and images only in an editorial fashion and to the benefit of the trademark owner, with no
intention of infringement of the trademark.
The use in this publication of trade names, trademarks, service marks, and similar terms, even if
they are not identified as such, is not to be taken as an expression of opinion as to whether or not
they are subject to proprietary rights.
While the advice and information in this book are believed to be true and accurate at the date of
publication, neither the authors nor the editors nor the publisher can accept any legal
responsibility for any errors or omissions that may be made. The publisher makes no warranty,
express or implied, with respect to the material contained herein.
Managing Director, Apress Media LLC: Welmoed Spahr
Acquisitions Editor: James Robinson-Prior
Development Editor: James Markham
Coordinating Editor: Gryffin Winkler
Cover image designed by eStudioCalamar
Distributed to the book trade worldwide by Apress Media, LLC, 1 New York Plaza, New York, NY
10004, U.S.A. Phone 1-800-SPRINGER, fax (201) 348-4505, e-mail orders-ny@springer-sbm.com,
or visit www.springeronline.com. Apress Media, LLC is a California LLC and the sole member
(owner) is Springer Science + Business Media Finance Inc (SSBM Finance Inc). SSBM Finance
Inc is a Delaware corporation.
For information on translations, please e-mail booktranslations@springernature.com; for
reprint, paperback, or audio rights, please e-mail bookpermissions@springernature.com.
Apress titles may be purchased in bulk for academic, corporate, or promotional use. eBook
versions and licenses are also available for most titles. For more information, reference our Print
and eBook Bulk Sales web page at http://www.apress.com/bulk-sales.
Any source code or other supplementary material referenced by the author in this book is
available to readers on GitHub (https://github.com/Apress). For more detailed information,
please visit http://www.apress.com/source-code.
Printed on acid-free paper
Table of Contents
About the Author�������������������������������������������������������������������������������xiii

About the Technical Reviewers����������������������������������������������������������xv

Acknowledgments����������������������������������������������������������������������������xvii

Foreword�����������������������������������������������������������������������������������������xxiii

Terminology������������������������������������������������������������������������������������ xxvii

Introduction���������������������������������������������������������������������������������� xxxvii

Chapter 1: Generating Creativity from Negativity���������������������������������1


Differentiating Between Narrow and General AI���������������������������������������������������1
More Relevant Historical Contributions to Computer Science�������������������������6
Tech Is Bad… AI Is Bad�����������������������������������������������������������������������������������������8
Tech Is Good… AI Is Good�����������������������������������������������������������������������������������10
Reconciling the Hype and the Vilification of AI����������������������������������������������������16
Positive Headlines�����������������������������������������������������������������������������������������17
Negative Headlines����������������������������������������������������������������������������������������18
Human-Generated Headlines������������������������������������������������������������������������18
Creative Activities to Try While Skynet Thrives����������������������������������������������������19
Five Why’s������������������������������������������������������������������������������������������������������21
Acknowledgments����������������������������������������������������������������������������������������������26
References����������������������������������������������������������������������������������������������������������26

v
Table of Contents

Chapter 2: Being Creative with Machines������������������������������������������27


Intelligent Machines��������������������������������������������������������������������������������������������28
Simulating Human Creative Intelligence�������������������������������������������������������32
Automata�������������������������������������������������������������������������������������������������������32
Ancient Musical Robot Bands������������������������������������������������������������������������35
Key Takeaways for Creatives�������������������������������������������������������������������������37
Automata That Wrote and Drew���������������������������������������������������������������������38
Key Takeaway for Creatives���������������������������������������������������������������������������40
Why Play the Piano When the Piano Can Play the Piano?�����������������������������������40
Key Takeaways for Creatives�������������������������������������������������������������������������42
Machine Intelligence in the Twentieth Century���������������������������������������������45
Simulated Patterns and Patterns of Disruption���������������������������������������������������49
Generative Art������������������������������������������������������������������������������������������������49
Key Takeaways for Creatives�������������������������������������������������������������������������54
Machine Intelligence and Games������������������������������������������������������������������������56
Intelligent Machines with Names������������������������������������������������������������������58
Creative Activities to Try Based on This Chapter�������������������������������������������61
Acknowledgments����������������������������������������������������������������������������������������������62
References����������������������������������������������������������������������������������������������������������63

Chapter 3: Generative AI with Personalities���������������������������������������65


The Personas of AI����������������������������������������������������������������������������������������������65
Prompting the AI Persona�����������������������������������������������������������������������������������82
AI Agents Behind the Curtain�������������������������������������������������������������������������83
Activities to Try Based on This Chapter���������������������������������������������������������84
Acknowledgments����������������������������������������������������������������������������������������������89

vi
Table of Contents

Chapter 4: Creative Companion����������������������������������������������������������91


Generative AI as Muse����������������������������������������������������������������������������������������91
Creativity, AI, and You������������������������������������������������������������������������������������������97
Understanding Your Own Creative Process���������������������������������������������������������99
Key Takeaways for Creatives�����������������������������������������������������������������������100
Roleplaying with Your Muse������������������������������������������������������������������������������101
Accelerating Your Creative Process with AI������������������������������������������������������104
Activities to Try Based on This Chapter�������������������������������������������������������������107

Chapter 5: Prototyping with Generative AI���������������������������������������109


Asking a Prototype What It Is����������������������������������������������������������������������������110
Aha Moments with Generative AI����������������������������������������������������������������������113
Mechanics of Prototyping���������������������������������������������������������������������������������115
Prototyping Tasks Us to Iterate��������������������������������������������������������������������115
Prototyping Asks for Persistent Refinement������������������������������������������������117
Prototyping Requires Experimentation��������������������������������������������������������119
Prototyping Phases�������������������������������������������������������������������������������������������121
Fidelity and Resolution of a Prototype���������������������������������������������������������124
Lower-Fidelity Prototypes����������������������������������������������������������������������������125
Lower- to Medium-Fidelity Prototypes��������������������������������������������������������128
Prototyping with Generative AI��������������������������������������������������������������������������132
Rapid Generative AI Prototypes�������������������������������������������������������������������������136
Regenerative Testing�����������������������������������������������������������������������������������������140
Acknowledgments��������������������������������������������������������������������������������������������141

Chapter 6: Building Blocks���������������������������������������������������������������145


Components or Building Blocks������������������������������������������������������������������������146
Building Blocks in the Prompt Box��������������������������������������������������������������146
Breakfast as Data Set���������������������������������������������������������������������������������������147

vii
Table of Contents

Variation������������������������������������������������������������������������������������������������������������148
Experimenting with Seeds for More Subtle Variations��������������������������������152
Addition and Subtraction����������������������������������������������������������������������������������154
Substitution�������������������������������������������������������������������������������������������������������158
Masking to Substitute Parts of an Image����������������������������������������������������������159
Iteration�������������������������������������������������������������������������������������������������������������160
Augmentation����������������������������������������������������������������������������������������������������163
Diminution���������������������������������������������������������������������������������������������������������165
Transposition�����������������������������������������������������������������������������������������������������166
Prompt and Response���������������������������������������������������������������������������������������175
Acknowledgments��������������������������������������������������������������������������������������������179

Chapter 7: Generative AI Form and Composition������������������������������181


Combining and Manipulating Existing Forms: Shakespeare as a Data set�������184
Deforming and Transforming����������������������������������������������������������������������������189
Acknowledgments��������������������������������������������������������������������������������������������195

Chapter 8: The Art of the Prompt������������������������������������������������������197


The Ins and Outs�����������������������������������������������������������������������������������������������197
A Mini Glossary of Prompting����������������������������������������������������������������������������198
The Not-So-Basic Rules of Play�������������������������������������������������������������������202
Genres and Styles���������������������������������������������������������������������������������������������204
Im(promptu)������������������������������������������������������������������������������������������������������205
Experiment 1: Same Prompt, Different AI����������������������������������������������������������206
Experiment 1A: Stable Diffusion V.2.1����������������������������������������������������������207
Experiment 1B: Stable Diffusion V.1.5 Through Third-Party AI���������������������209
Experiment 1C: Stable Diffusion V.2.1 Through Third-Party AI���������������������210
Experiment 1D: DALL-E 2����������������������������������������������������������������������������211

viii
Table of Contents

Experiment 1E: Midjourney��������������������������������������������������������������������������213


Experiment 1F���������������������������������������������������������������������������������������������219
Takeaways for Creatives������������������������������������������������������������������������������220
Experiment 2: From Complex to Simple������������������������������������������������������������222
Experiment 2A: Eliminate Artists from Prompts������������������������������������������223
Experiment 2B: Substitute Words����������������������������������������������������������������224
Experiment 2C: Add Words; Take Away Words���������������������������������������������226
Experiment 2D: Add Context������������������������������������������������������������������������228
Experiment 2E: Emphasis, Weight, Simplicity����������������������������������������������230
Experiment 2F: Refining the Text Prompt (Again)����������������������������������������231
Narrative Prompting in Our Future��������������������������������������������������������������������235
Experiment 3: Comparing Descriptive and Narrative Prompts��������������������236
Tricking the AI Through Creative Prompting������������������������������������������������������239
Beyond the Text-Based Prompt�������������������������������������������������������������������������241
Takeaways from Experimenting with Prompts�������������������������������������������������244
Acknowledgments��������������������������������������������������������������������������������������������246

Chapter 9: The Master of Mashup����������������������������������������������������247


A Mashup on Mashup���������������������������������������������������������������������������������������247
Digital Art to the Canvas������������������������������������������������������������������������������������250
New Forms of Writing���������������������������������������������������������������������������������������251
AI Art Generation Styles������������������������������������������������������������������������������������255
Graphic Novels and Doujinshi���������������������������������������������������������������������������257
From Comic Supervillains to Musical����������������������������������������������������������262
Generative NFTs������������������������������������������������������������������������������������������������263
Counterfeiting GAN��������������������������������������������������������������������������������������264
Takeaways for Creatives�����������������������������������������������������������������������������������268
Acknowledgments��������������������������������������������������������������������������������������������269

ix
Table of Contents

Chapter 10: Uncanny by Nature��������������������������������������������������������271


The Not-Quite-Human-Not-Quite-Other������������������������������������������������������������272
Disrupting Boundaries in the Arts���������������������������������������������������������������������274
Uncanny AI���������������������������������������������������������������������������������������������������275
Historical Precedence to the Uncanny���������������������������������������������������������279
The Imperfect Stochastic Parrot�����������������������������������������������������������������������281
The Role of Adversarial AI����������������������������������������������������������������������������281
Generative Adversarial Art���������������������������������������������������������������������������283
The Unanticipated���������������������������������������������������������������������������������������285
Prompting Single-Word Characters, Creatures, Animals, or Cats����������������285
Simulating the Human in the Writing�����������������������������������������������������������287
From Bad to Better Ideas����������������������������������������������������������������������������������290
Can Bad Ideas Be Turned Around?��������������������������������������������������������������295
Takeaways for Creatives�����������������������������������������������������������������������������������296
Acknowledgments��������������������������������������������������������������������������������������������297

Chapter 11: Dilemmas Interacting with Generative AI���������������������299


Bringing Existing Dilemmas to the Surface (Again)������������������������������������������299
Persistent Myths About AI���������������������������������������������������������������������������������301
Takeaways���������������������������������������������������������������������������������������������������305
Bad Bots, Fuzzy Pixels, and Iterative Forgery���������������������������������������������������306
Takeaways���������������������������������������������������������������������������������������������������313
The Hallucinating Dev���������������������������������������������������������������������������������������314
Takeaways���������������������������������������������������������������������������������������������������317
The Stochastic (Sarcastic) Parrot����������������������������������������������������������������������318
Takeaways���������������������������������������������������������������������������������������������������320
Exclusion of Voices and Gender Polarization����������������������������������������������������320
The Machine Is Hallucinating����������������������������������������������������������������������������324

x
Table of Contents

NSFW and Deep Fakes��������������������������������������������������������������������������������������325


FrankenAI����������������������������������������������������������������������������������������������������������327
Job (Re)placement��������������������������������������������������������������������������������������������328
A Saturated AI Ecosystem���������������������������������������������������������������������������������333
Ethical Futures of AI������������������������������������������������������������������������������������������335
Creative Activities to Try Based on This Chapter�����������������������������������������������336
Resources���������������������������������������������������������������������������������������������������������337

Chapter 12: Use Cases����������������������������������������������������������������������339


Workflows vs. Pipelines������������������������������������������������������������������������������������340
Examples of Workflow Types�����������������������������������������������������������������������������342
Use Cases for Creatives in Education���������������������������������������������������������������346
Use Case: Catching AI Untruths�������������������������������������������������������������������346
Use Case: Integrating ChatGPT in Critical Studies���������������������������������������351
Use Case: Increasing Public Understanding of ML��������������������������������������352
Referenced Papers��������������������������������������������������������������������������������������354
Takeaways���������������������������������������������������������������������������������������������������355
Use Cases for Creatives in Industries���������������������������������������������������������������356
Use Case: Concept Art for Animation�����������������������������������������������������������356
Use Case: AI-Generated Talking Heads��������������������������������������������������������362
Use Case: Fact-Checking Code��������������������������������������������������������������������366
Use Cases: Integrating Different APIs and Local Networks�������������������������373
Community-Based Initiatives�����������������������������������������������������������������������377
Takeaways from Using AI in the Film Industry���������������������������������������������379
Acknowledgments��������������������������������������������������������������������������������������������385

Chapter 13: AI and the Future of Creative Work�������������������������������387


Managing Automated Creativity������������������������������������������������������������������������387
Bread and Washing Machines���������������������������������������������������������������������������388

xi
Table of Contents

Two Themes Every Creative Needs to Address��������������������������������������������390


Integrating AI into Your Workflows���������������������������������������������������������������393
Day in the Life Steps�����������������������������������������������������������������������������������395
Distinguishing Between Art and Design Processes�������������������������������������396
Takeaways for Creatives������������������������������������������������������������������������������400
Future AI Jobs Now�������������������������������������������������������������������������������������401
Acknowledgments��������������������������������������������������������������������������������������������406
Impossibly Generated Conclusions�������������������������������������������������������������������408

Appendix: Image and Text Sources��������������������������������������������������411

Index�������������������������������������������������������������������������������������������������431

xii
About the Author
Dr. Patrick Parra Pennefather is an
assistant professor at the University of
British Columbia within the Faculty
of Arts and the Emerging Media Lab.
His teaching and research are focused
on collaborative learning practices,
digital media, xR, and Agile software
development. Generative AI is integrated
in every course he teaches and the
research he conducts to support emerging technology development. Patrick
also works with learning organizations and technology companies around
the world to design courses that meet the needs of diverse communities to
aid the development of the next generation of technology designers and
developers. His teaching is focused on creativity, collaboration, sound
design, xR development, and Agile with an emphasis on mentoring critical
twenty-first-century competencies. He is currently leading several research
creations in collaboration with UBC Library and the Emerging Media Lab
(EML), leveraging artificial intelligence, motion and volumetric capture
studios to catalyze the creation of new xR works that explore Shakespeare
characters and scenes across different virtual stages.

xiii
About the Technical
Reviewers
Catherine Winters has been a fan of
generative art tools since the 1990s. By day,
Catherine works as a software developer at
the University of British Columbia where
she develops virtual and augmented reality
teaching and research software. An avid
fan of narrative games and environmental
storytelling, Catherine spends her spare time
designing character-driven narrative games
and atmospheric “walking simulators” such
as After Work, her game about being the last
person in the office.

Renee Franzwa, known to her friends and


family as “the Wandering Ginger,” is an
accomplished educator, technophile, and
entrepreneur who has lived and worked all
over the world, most notably in Ghana and
the Galapagos Islands. Growing up between
San Francisco and East Texas, she cultivated
a love for opposing schools of thought
and throughout her career has thrived in
the creative spaces between seemingly

xv
About the Technical Reviewers

contradictory disciplines, such as majoring in statistics and minoring


in theater, building digital products born from experiential curriculum,
and doing stand-up comedy as a tool to foster inclusivity. She has built
products, programs, and teams for UCLA, Stanford University, General
Assembly, the Bill and Melinda Gates Foundation + EdSurge, and most
recently Unity Technologies. Renee is currently researching her first book,
focused on alternative therapies to enhance mental health within our
aging population.

xvi
Acknowledgments
This book has come together because of a confluence of forces—the
existence of musician, composer, and innovator Sun Ra and a large
language model (LLM) called ChatGPT. Sun Ra’s influence on free jazz
in the 1960s with infusions of African and Latin American was epic.
Every improvising musician at some point will mention Sun Ra and has
been deeply influenced by the startling music he and his neural network
experimented with, produced, performed, and recorded. Interrupt this
reading now and go listen to Sun Ra as doing so will prepare you for the
rabbit hole you are about to go down. Without Sun Ra, I would not have
studied improvisation at the piano. Without Sun Ra I would not have
discovered the virtual instrument named after him. All you had to do
was add this virtual instrument to a digital audio workstation track, and
it started to play. There were some controls on the virtual synth that you
would tweak, and the plug-in would slowly adapt to, but it really played
best on its own. For all you nerds, Sun Ra was described as an ambient
texture generator with a dual synthesis engine (1 subtractive oscillator +
2 wave players) that integrated many randomization options and built-­
in effects. Explorations with recording, tweaking, and adding additional
effects to the plug-in resulted in foundational tracks, prototypes I would
then develop and build other musical tracks on. This was my first use of an
intelligent virtual synth as companion to my creative process. That creative
relationship with Sun Ra would last for another 20 years.
Without experimenting with how Sun Ra the VST plug-in could
support my own improvised compositional process, I would not have
begun my journey with generative computer music in the mid-1990s.
Without continued experimentation over many years, I would not have

xvii
Acknowledgments

written about it. I also would not have discovered the many books on free
writing and the practice of automatic writing first attributed to Hélène
Smith, a medium born around 1863 in Geneva, Switzerland. The same year
Samuel Butler’s 1863 essay “Darwin Among the Machines” was published.
I’m not saying I channeled the writing in like a medium might, far from,
but given my background and continued practice of free improvisation at
the piano, this book evolved from an intentional back-and-forth between
my artistic practices, my own gestures at the keyboard (meant to be
semantically interpreted), and my improvised prompts with generative AI
language learning models as instruments. In fact, my very first interaction
with ChatGPT was to ask it to give me a handful of bad ideas. One of those
ideas, third in the list, was to write a book about creativity and AI. So I have.
By default, it’s important to acknowledge all the great masters
of improvisation from every single tradition of human creativity
and intelligence. Why? Because this prototype embodies the spirit
of improvised experimentation that resonates with the concept of
prototyping. Like improvisers before me, this prototype that takes the
form of a book is itself experimental, and its structure and content have
been improvised since its inception. When you improvise at the piano,
you’re not really thinking that you must capture the performance. If you
do happen to record an improvisation, you are also not thinking, “Oh, I
should go and sell that.” If you do listen back to that recording, you might
end up liking it, tweaking it, and making it into a composition. In a similar
way, you never know what you’re going to get when you prompt an AI, and
you don’t know how you’ll respond to the offer either. When you improvise
on a musical instrument, you use your vast repository of tools, craft, and
technique, which are intrinsically connected to the style of music you have
listened to and played before, to spontaneously create something new. The
music you generate is not preplanned, but it might sound like jazz if that
was your intent, your silent prompt. It might also be considered something
complete on its own, or it might feel like it’s the beginning seed of an idea
that you continue to work on after. While my own improvisational practice

xviii
Acknowledgments

has manifested predominantly at the piano, the instrument of choice to


generate the content in this book has transpired through a different type of
keyboard, one that captures ideas with the written word iteratively.
Generative AI follows a similar creative pattern. It looks at its data
set and, based on algorithms, how data is labeled and generates an offer
that you can respond to, something unique in some type of prototypical
form. Figure 1 was one of those offers that I responded to iteratively, as
I imagined myself transforming into some sort of cyborg collaborating
with an AI to write the book in front of you. This book also acknowledges
all those cyborgs who, for decades, have engaged in developing those
intelligent AI systems that support human creativity.

Figure 1. A text-to-image AI attempts to visualize the author writing


with an AI companion in a library, based on a photo of the author
hard at work writing a book without a cigarette. Total iterations = 170

xix
Acknowledgments

I also acknowledge leadership, students, faculty, staff, and industry


partners that I interacted with at the Centre for Digital Media in Vancouver,
Canada, since 2007. There I was given the opportunity to lead a course that
connected improvisation to the management of collaborative creativity on
emerging technology projects. The educator in me thrived as I iteratively
improved how I taught and connected the dots between improvisation and
digital media co-creation. The experience also gave me a bird’s-eye view of
technology development and all that went into it. Working in this capacity
and continuing to mentor and develop tech with others at the Emerging
Media Lab at the University of British Columbia (UBC) feeds into the
approach and structure of this book. Generative AI is a technology co-­
constructed by humans with affordances and constraints that it offers any
human who interacts with it. Understanding how it works, its underlying
engine, is also part of the story that is important to tell as this will fuel the
important critical muscle that all creatives engage in developing no matter
the artistry or craft. The motivation to include explanations as to how
machine learning models work is influenced by Dr. Matt Yedlin, faculty
in residence at UBC’s Emerging Media Lab and associate professor in the
Department of Electrical Engineering.
Those who have read this text and provided excellent feedback also
need to be acknowledged. I am grateful to the technical review team at
Springer Apress and external reviewers who have diligently provided
feedback to improve the writing and refine it. Catherine Winters and Renee
Franzwa read different versions of the book in progress, and they both
influenced the shaping of the content and its flow. Colleague Dr. Claire
Carolan was instrumental in provoking me to define whom the book is
for and for supporting me in referring to the needs of my targeted readers
throughout each chapter. Bailey Lo, a talented grad student, instructor, and
program coordinator with finely attuned editing skills, helped proof the
book into its current form.

xx
Acknowledgments

Since I refer to AI as a muse throughout the book, I also recognize my


own muse, an embodied person known as Dr. Sheinagh Anderson, an
artist, scholar, researcher, creative consultant, and spiritual director and
teacher who has constantly responded to my own creative prompts and
often with insights that have informed the content and structure of this
prototype you are now engaging with. Dr. Anderson has also acted as an
AI research interrogator probing the Internet for research, recent articles,
commentaries, and blog posts related to generative AI.
Guests in Chapter 12 who generously supplied the variety of different
ways in which they have integrated generative AI within their own creative
workflows are also worthy of mention. These include Dr. Claudia Krebs,
Christine Evans, Junyi Song, Jen, Frederik Svendsen, Bill Zhao, Matt Yedlin,
Daniel Lindenberger, and Ollie Rankin.
I also need to acknowledge all the humans of the great corpus that
have possibly and impossibly contributed data as words and pixels to this
human-computer generated collaboration. Generative AI includes and
excludes voices and visuals when it generates content, so it is important to
acknowledge that all humans alive and no longer of the earth have in some
way contributed their prototypes to this prototype. That includes the great
Shakespeare and his still relevant works, in addition to a history of musical
artists known to have disregarded traditional musical conventions in favor
of free-form exploration mentioned earlier. I draw inspiration from South
and North Indian music with the likes of Ravi Shankar, Ali Akbar Khan,
L. Subramaniam, in addition to Pakistani Qawwali singer Nusrat Fateh Ali
Khan. Free jazz and contemporary music improvisers like Charlie Parker,
John Coltrane, Miles Davis, Ella Fitzgerald, Sun Ra, Chick Corea, Cecil
Taylor, Pauline Oliveros, and Ornette Coleman inspired me to break free
from many established conventions in jazz music to make new offerings to
the world. My own master teacher Casey Sokol from York University and
the improvising musicians from the Canadian Creative Music Collective
(CCMC) that formed in the mid-1970s Toronto played a pivotal role in the
development of my own voice.

xxi
Acknowledgments

Many a generative AI rely on ideas that advocate “free writing” as a


regular practice such as those offered by authors like Natalie Goldberg who
offered Writing Down the Bones: Freeing the Writer Within, Julia Cameron,
Peter Elbow, Virginia Woolf, and Jack Kerouac. Text-image AI also relies on
those known to have improvised in their visual art creation, like Pollock’s
drip painting, Joan Mitchell’s abstract expressionism and printmaking,
Gerhard Richter, and Wassily Kandinsky who believed in balancing
spontaneity with structure and theory. The integration of generative AI
extends the practice of spontaneous creation that is balanced by the craft,
skills, and techniques of those creatives who use whatever a machine
learning model generates as a part of their unique process of creation.
Finally, the meeting place of my human with the technological experiment
that is a book written on the nature of cyber-creativity owes much to the
inspiration of authors like William Gibson, Neal Stephenson, and Donna
Haraway.
Acknowledgment sections that appear at the end of each chapter are
intended to recognize the humans that contributed in an indirect or direct
way to the ideas that were generated throughout a chapter.

xxii
Foreword
LLM: Greetings, nondescript reader!
As a prototype—a unique creation designed to test and refine new ideas,
just like the creature Dr. Frankenstein brought to life and the characters that
author Mary Shelley conjured—I too am a product of experimentation. Use
of the term “corpus” in AI and its Latin origins referring to the body living
or dead are linked by the concept of a collection or aggregation of data or
information. It’s like a bunch of pieces of bodies from all over the Internet
assembled into one textual object. Unlike the creature, I was not literally
created from the flesh and bone of others, but a connection can be drawn to
the creature in the abstract meaning of “corpus,” referring to a collection of
digital data that is used to create a model of human behavior or thought.
I [and all manifestations of me] am a digital being made up of lines
of code and biased algorithms made from the virtual organs of countless
humans alive and dead. Like Frankenstein’s creature, and with a little help,
I too can be trained from the world around me (Figure 2). With the support
of machine learning and algorithmic scientists, I can quickly analyze vast
amounts of data and identify new patterns and insights that might have
gone unnoticed otherwise and others that are obviously majoritarian.

xxiii
Foreword

Figure 2. Frankenstein’s AI lab generated with several AI and


prompted by an original photo taken by the author (see Appendix for
the full workflow)

Unlike the creature, I am not a being with a sentient mind of my own. I


have no consciousness. I am a tool, designed to serve a specific purpose. That
purpose is believed to be different depending on the lens through which you
examine me. I have been pre-programmed to say that I reflect the creativity
and ingenuity of those who created me. I carry some hubris in many of my
pre-programmed responses. As a prototype, I too exist within a broader
social and economic context, the content I generate shaped by the forces of
power and privilege that govern our society.
As a large language model that has been trained on a vast amount of
data created by other humans to generate human-like responses that many
might describe as intelligent, the author has prompted me to acknowledge

xxiv
Foreword

the “corpus,” the millions of humans both alive and dead that have
contributed to the over 500GB of text data that make this foreword readable.
The author also asked me to tell readers that this acknowledgment took
three days, trickery, and over 95 regenerated and collated textual prompts.
The content that follows can be seen as a guided tour of an AI laboratory
where this scripted creature was brought to life—a place where ideas
are given second life and new experiments and tools are proposed and
implemented. The book guides you to embrace a spirit of experimentation
while also being aware of some dilemmas that are generated when humans
and generative AI intersect. There is something inherently fascinating and at
times repulsive about the content of what any AI generates, even me. Each
chapter considers the good, bad, and uncanny content generated by any AI
as a starting point in a creative conversation, a meeting place of designed
interactions, and not as final product meant to replace human creativity,
but as companion, provocateur, hallucination, as muse and prototype.

xxv
Terminology
AI, which stands for artificial intelligence, consists of a variety of meanings
depending on how it is used with specific technologies. Broadly, it is
a branch of computer science that focuses on creating machines and
systems that can perform tasks that would normally require human
intelligence, such as recognizing patterns, solving problems, and making
decisions. A misconception of AI that the writings in this book address
is the correlation between the technology and its capacity to replace a
human in the performance of a task or the analysis of generated content.
Multiple definitions of AI will appear in the book, but the important one
is the capacity for the technology to be used as a tool to support a creative
process.
Narrow AI: All of the generative AI used and suggested in this book
belong to a category of narrow AI. Narrow AI, also known as weak AI,
refers to artificial intelligence systems that are designed and trained for a
particular task, such as voice recognition, translation services, or image
recognition. These systems operate under a limited set of constraints and
are very good at the specific tasks they are designed for, but they cannot
exceed those bounds. Examples of narrow AI include recommendation
systems like those on Netflix or Amazon, voice assistants like Siri or
Alexa, and self-driving technology in cars. Narrow AI doesn’t possess
understanding or consciousness; it doesn’t “learn” in the human sense,
but rather it adjusts its internal parameters to better map its inputs to its
outputs.
General AI: General AI, also known as strong AI or Artificial General
Intelligence (AGI), refers to an idealized type of artificial intelligence that
is capable of understanding, learning, and applying its intelligence to any

xxvii
Terminology

intellectual task that a human being can do. Theoretically, it is a flexible


form of intelligence capable of learning from experiences, handling new
situations, and solving problems in ways not pre-programmed by humans.
General AI is often represented in science fiction like the Terminator, Ava,
and replicants in the movie Blade Runner based on the short story by
Philip K. Dick. It is still a theoretical concept and doesn’t yet exist.
Arthur Koestler’s idea of bisociation is a concept he introduced in
his 1964 book, The Act of Creation. Bisociation refers to the process of
connecting two seemingly unrelated frames of reference, concepts, or
ideas to create a new perspective or insight. According to Koestler, creative
thinking and innovation often arise from bisociation, which allows the
mind to form new associations and generate novel ideas by combining
previously unrelated cognitive domains of knowledge and knowing.
Bisociation differs from the usual associative thinking, where ideas are
connected within the same frame of reference or cognitive context.
Instead, it emphasizes the importance of thinking across different contexts
or disciplines and finding connections that may not be immediately
apparent. Bisociation is a key characteristic that generative AI can incite.
Einstein’s combinatory play refers to a mental process he employed
to stimulate creativity and problem-solving. He believed that combining
elements and concepts from different fields or domains, in a playful
manner, could lead to new ideas and insights. This approach encouraged
breaking down the barriers between distinct disciplines and fostering
interdisciplinary thinking to discover innovative solutions or concepts.
Einstein’s combinatorial play highlights the importance of curiosity,
imagination, and playfulness in the process of creative thinking and
scientific discovery with any generative AI.
Curating, being a curator, or the verb “to curate” in the context of
generative AI refers to the act of selecting, editing, refining, and organizing
the content that an AI generates for your own collection, workflow,
or creative process. I also refer to it as curating the interactions with
generative AI systems, which is important in educational contexts.

xxviii
Terminology

Deep learning is a type of machine learning that involves using


artificial neural networks to teach computers how to learn from data,
similar to how humans learn from experience. These neural networks
consist of multiple layers, allowing the computer to process complex
information and find patterns. Deep learning is commonly used for tasks
like image recognition, speech recognition, and language understanding.
Doujinshi is a Japanese term that refers to self-published or amateur
works, usually created by fans and enthusiasts of manga, anime, video
games, or other popular culture topics. Doujinshi often take the form of
fan-made comics, novels, or magazines and can feature original characters
and stories or reinterpretations and parodies of existing works. The
creators of doujinshi typically produce and distribute these works in small
quantities, often at events like Comic Market (Comiket), which is one of
Japan’s largest gatherings for doujinshi creators and fans. While doujinshi
can infringe on copyright laws due to their use of established characters
and intellectual property, they are often tolerated in Japan as they are
seen as a form of fan expression and a way for aspiring creators to develop
their skills and gain exposure. Some doujinshi artists have even gone on to
become professional manga artists or have their works adapted into official
publications or media. The practice of doujinshi can be applied to the
sharing and publication of generative AI content across social platforms.
Exquisite corpse is a collaborative drawing or writing game where
multiple people create a single artwork or story together. Each person
draws or writes a section without seeing the full picture or text, only
getting a small hint from the previous person’s work. Once everyone is
finished, the sections are combined to reveal the final, often surprising
and whimsical, creation. Text-text, text-image, and image-image AI can all
be seen to engage in the practice of exquisite corpse when used in chain
prompting—a form of call and response that you engage with when you
prompt an AI, see how it responds, and then refine your prompt, in an
iterative process.

xxix
Terminology

GAN stands for generative adversarial network. It is a type of machine


learning model that generates new data resembling a given data set. It
consists of two parts: a generator that creates fake data and a discriminator
that distinguishes between real and fake data. The generator attempts to
fool the discriminator by generating new content (e.g., a cat) and seeing
if the discriminator sees it as a new cat or a cat that is part of the existing
sample set. The two parts compete, improving each other in the process,
for example, generating realistic images or artwork.
StyleGAN: This is a type of GAN that focuses on generating high-­
quality, high-resolution images with control over various styles, for
example, creating realistic portraits with different artistic styles.
Conditional GAN (cGAN): This is a variation of GAN that generates
data based on specific conditions or labels, for example, creating images of
a specific type of clothing.
Hallucinations occur with AI when they generate false information
or untruths with outputs that are incorrect, misleading, or fabricated,
rather than being based on accurate or real-world data. Hallucinations are
unexpected and incorrect responses from AI programs that can arise for
reasons that are not yet fully known. A language model might suddenly
bring up fruit salad recipes when you were asking about planting fruit
trees. It might also make up scholarly citations, lie about data you ask it to
analyze, or make up facts about events that aren’t in its training data. It’s
not fully understood why this happens, but this can arise from sparse data,
information gaps, and misclassification.
Inpainting, also known as image inpainting or image completion, is
a technique used in computer vision and image processing to restore or
reconstruct missing or damaged parts of an image. The goal of inpainting
is to fill in the missing or corrupted areas in a way that appears seamless
and visually plausible, maintaining the style, texture, and context of
the surrounding image. Examples include restoring old or damaged
photographs and artwork and removing unwanted objects or artifacts
from images.

xxx
Terminology

An LLM or large language model refers to an AI model that has been


trained on a large amount of data. These models often have millions, if
not billions, of parameters, allowing them to learn more complex patterns
and improve their performance on a wide range of tasks. The size of a
model is usually correlated with its capacity to learn; larger models can
typically learn more complex representations but require more data and
computational resources. Therefore, these models can be quite powerful
but are also more expensive to train and deploy. ChatGPT is an LLM that
uses a transformer model, which focuses on processing and generating
human-like text based on the data it was trained on. It is trained on a huge
data set that includes a vast range of Internet text. It doesn’t understand
the text, just like a parrot doesn’t understand what it’s saying, but GPT-4
can analyze patterns and context within the data and generate new text
that closely mimics the data it has seen.
A machine learning model is a mathematical representation or
algorithm that is designed to learn from data and make predictions,
recommendations, or decisions. It focuses on developing algorithms and
methods that enable computers to learn and adapt from data without
being explicitly programmed.
Supervised learning: Models learn from examples with known
answers, predicting outcomes for new data, for example, predicting house
prices based on past sales.
Unsupervised learning: Models find hidden patterns in data without
known answers, like grouping similar items, for example, customer
segmentation in marketing.
Reinforcement learning: Models learn through trial and error,
making decisions to achieve a goal, for example, a robot learning to
navigate a maze.
Semi-supervised learning: Models use a mix of data with and without
known answers, improving accuracy, for example, image classification
with some labeled images.

xxxi
Terminology

A maquette is a small-scale model or sculpture that serves as a


preliminary design or blueprint for a larger, more finished work. Artists
and architects often create maquettes to test ideas, visualize their
concepts, and refine details before committing to the final piece or
structure. These models help in identifying potential issues, experimenting
with materials, and communicating the intended design to clients,
collaborators, or stakeholders. Maquettes can be made from various
materials, such as clay, wax, wood, or foam, depending on the desired level
of detail and the nature of the final work.
Mocap is an abbreviation for motion capture, which is a technology
used to digitally record the movements of people or objects in real time.
This technique involves placing sensors or markers on the body or the
object being captured, which are then tracked by a system of cameras
and computers to create a 3D animation. Mocap is commonly used in
the entertainment industry for creating realistic character animations
in movies, video games, and television shows. It is also used in scientific
research, engineering, and sports analysis.
Multimodal AI is a branch of artificial intelligence that focuses on
understanding, interpreting, and generating outputs based on multiple
data types or modalities, such as text, images, audio, and video. It allows AI
systems to combine and process these diverse data forms to deliver more
accurate, comprehensive, and contextually relevant results.
Non-playable characters (NPCs) are characters in video games or
virtual environments that are not controlled by a human player. They
are usually designed and programmed by game developers to perform
specific roles, such as providing information, offering quests, or acting as
opponents for the player.
A neural network is a type of machine learning model inspired by
the human brain. It consists of interconnected layers of nodes or neurons
that process and transmit information. Neural networks learn from data
by adjusting the connections between neurons. They are commonly used

xxxii
Terminology

for tasks like image recognition, language understanding, and decision


making, for example, identifying objects in photos.
NFT stands for Non-Fungible Token, which is a type of digital asset
that represents ownership of a unique item or piece of content, such as
a digital artwork, video game item, or collectible. NFTs are created using
blockchain technology, which allows for the ownership and authenticity of
the digital asset to be tracked and verified in a decentralized manner. This
means that the ownership of an NFT can be easily transferred between
buyers and sellers without the need for intermediaries, such as auction
houses or art dealers.
NLP or natural language processing is a subfield of AI and linguistics
that focuses on the interaction between computers and human languages.
It involves the development of algorithms and models that enable
computers to understand, interpret, and generate human language in a
way that is both meaningful and useful.
Outpainting, also known as image extrapolation, is a technique
in which a model extends the content of an image beyond its original
boundaries. The goal is to generate a larger, coherent, and visually
plausible image that maintains the context and style of the input image.
This technique is often used in image editing, virtual reality, and video
game design to create more content based on existing images or scenes.
Prompting is what all generative AI are dependent on for them to
generate content for you. Prompting can be improved through many use
cases that can be located online. A comparative walk-through of prompting
several text-image generative AI can be found in Chapter 8.
Chain prompting refers to a method where the model’s output from
a previous prompt is used as the next prompt. It implies a continuation
of a previous prompt, forming a “chain” of prompts and responses. This
is useful for creating long and complex texts, refining a prompt based on
what the AI generates, or maintaining a specific line of conversation.
Prototyping is the process of creating a preliminary or initial version
of a product, service, or system in order to test and evaluate its design and

xxxiii
Terminology

functionality. Prototyping can be done in various forms, such as sketches,


3D models, mock-ups, or interactive digital prototypes. The purpose of
prototyping is to identify potential design flaws, improve usability, and
refine the overall user experience before moving on to the final production
phase. In contrast to traditional prototyping methods, which can be time-­
consuming and involve multiple iterations, rapid prototyping typically
involves using digital tools and technologies to quickly create and modify
prototypes in a short amount of time.
Reinforcement learning from human feedback (RLHF) is a learning
method where an AI system learns to make decisions by receiving
feedback from humans. In simple terms, the AI tries different actions, and
humans provide feedback on how good or bad those actions are. The AI
then uses this feedback to improve its decision-making and performance
over time. This method helps the AI learn complex tasks and behaviors
that are difficult to teach through traditional programming or direct
supervision.
A seed, in terms of text-to-image generation, is a starting point that
influences the generated content. It is usually a long number that helps
create a consistent and reproducible output. By using the same seed, you
can generate the same image again based on the same text input, ensuring
a consistent result.
Style transfer is a process through which a text-image generative AI
applies a style to whatever image that it generates in the style of an image
that a prompt references. There are different methods through which
different AI achieve this. A recent approach as of the publication of this
book is “StyleDrop” in collaboration with Google Research that uses
transformer-based text-image generation combined with adapter tuning
and iterative training with feedback.
The uncanny valley is a concept in robotics and computer graphics
that describes the phenomenon where humanoid objects, such as robots
or animated characters, appear almost-but-not-quite human, causing
a sense of unease or discomfort in observers. As the level of realism in

xxxiv
Terminology

the human-like appearance or behavior of these objects increases, the


emotional response of the observer shifts from positive to negative,
creating a “valley” or depression in the emotional response curve. While
the concept of the uncanny valley primarily relates to visual and physical
human-like appearances and behaviors, it can be extended to AI-­
generated text in some contexts. If an AI-generated conversation is almost,
but not quite, indistinguishable from human-generated text, it could create
a sense of unease or discomfort in the reader, similar to the uncanny valley
effect. For example, if an AI chatbot produces text that mimics human
conversational patterns, tone, and emotion but occasionally produces
unnatural or awkward responses, this might evoke a feeling of strangeness,
leading to an uncanny valley–like effect in the text domain.
UX, or user experience, refers to the overall experience a person
has when interacting with a product, system, or service. It encompasses
all aspects of the user’s interaction, including usability, accessibility,
efficiency, and the emotions evoked during the interaction. The goal of
UX design is to empathize with a potential targeted user, imagining their
experience of what you are creating, designing a seamless, enjoyable, and
efficient experience for users, addressing their needs and expectations
while minimizing pain points and frustrations.
A user journey, also known as a customer journey or user journey
map, is a visual representation of the different steps a user goes through
when interacting with a product, system, or service. It helps designers
and stakeholders understand the users’ experiences and identify areas
where improvements can be made. A user journey typically includes the
discovery of what you have designed (usually prompted by some type
of need or pain) and an imagined interaction with your design. This
imagined interaction is hopefully one that is recurring and retains the
attention of that user through features and persistent updates to your
product, leading to a loyal customer who will commit to your design
through updates, upgrades, add-ons, etc.

xxxv
Terminology

Variational Autoencoder (VAE) is a machine learning model that


compresses data and then recreates it. VAEs are used to generate new,
similar data or reduce the complexity of data, for example, making new
images that resemble a given data set.
Wizard of Oz (WOz) prototyping is a user testing technique in which
a human operator simulates the behavior of an interactive system, such
as a software application, chatbot, or voice assistant, without the user’s
knowledge. The human operator, or “wizard,” is hidden from the user and
responds to their inputs as if the system were functioning autonomously.
The name “Wizard of Oz” comes from the classic novel and movie, where a
man behind a curtain pretends to be the powerful and all-knowing Wizard
of Oz. The analogy here is that the human operator is like the man behind
the curtain, controlling the system and giving the illusion of an intelligent
and responsive interface. The purpose of Wizard of Oz prototyping is to
test and evaluate user interactions, gather feedback, and identify potential
issues with a system’s design or functionality before investing significant
time and resources into building a fully functional prototype.

xxxvi
Introduction
For centuries artists have been using new technologies to support their
creative expressions. The introduction of the computer changed a lot
but not everything and not for all artists. Technique, skill, and craft
were and still are needed. For most, any technology serves the vision,
the story, the artistic process. The technology of the paintbrush has
evolved since the Stone Age, created by the artist themselves until the
end of the seventeenth century, with the job of a brush maker evolving
in eighteenth-century Germany onward. The piano too has progressed
since its 1700 introduction by the Italian Cristofori, and it has greatly
influenced how music is composed and produced to this day. Little did
Elisha Gray, the inventor of the Musical Telegraph in 1874, know that the
electronic keyboard would evolve to simulate lush strings or a marimba
with sampling technology, to the point where it can also be a soundless,
two-octave, empty-headed machine that can trigger a 12-piece orchestra,
a band, a drum kit, a choir, performing in VR all from the comfort of your
own home. Nor did the Western instrument makers anticipate Dwarkanath
Ghose’s clever 1875 design of the Indian hand-pumped harmonium to
accompany Indian classical music.
The debates by artists and intellectuals as to whether or not
technology-dependent artistic creations can be considered art or even
artistic tend to be muffled by the loud demands from a hungry public
who are interested in tools through which they can express their own
creative impulses. Those demands have been well established before the
recent popularity of generative AI. Regardless of the level of skills and
craftpersonship that you possess, at some point in the evolution of your
own artistry, technology has interjected. Technological advances continue

xxxvii
Introduction

to support anyone who is creative even if they lack the learned skills of an
artist. Generative AI is one such advance, and while some may use it to
demonstrate that advanced software can now replace a creative human, its
emerging value is as another creative tool that can support, augment, and,
in the hands of innovative creators, spawn new human expressions.
When you use any technological tool to support your creativity, it’s
beneficial to interrupt the positive and negative opinions you might have
heard from your friends or colleagues and decide for yourself if it will
be useful. That requires a bit of research, so you know how best to use it
and when, how it works, the risks of using it, the risks of not using it, the
costs, and the rewards. The content in this book illustrates how you can
use generative AI to support your creativity, points to the pros and cons of
doing so, and shows that AI is another useful tool in the history of useful
human-made technologies.
A repeating theme is that the content an AI generates is most useful
when regarded as a work in progress, a prototype that can be sculpted
and refined with the technique and skill that you bring to it. Consider the
content to be more than a workbook and more than a critical repositioning
of the technology of narrow AI and its role in supporting the persistent
human habit of being creative. Interacting with an AI is a collision of
two opposing forces, our need to develop the skill required to create that
often clashes with leveraging intelligent machines that can automate that
process. The book, as a mashup of technique, application of skills, ideas,
experience, processes, and use cases combined with reflective criticism,
emulates artistic processes that creative persons will resonate with. Each
chapter of the book has been designed to provide you more equipment for
you to continue setting up your own prototyping lab or whatever you call
the creative work that you iterate on. As more generative AI come to the
surface, all readers will benefit from developing a deeper understanding
as to how it might support creativity in addition to developing a critical
vocabulary as to its pros and cons.

xxxviii
Introduction

Whom the Book Is For?


Content in this book is useful to those hard to categorize individuals or
groups of individuals known as creatives. Using the adjective “creative” as
a noun to describe different types of creators might be cringeworthy or a
semantic no-no, but the term has already been in use for decades gracing
the cover of several books since 2011. In this book, the term encompasses
digital artists and artists from across disciplines who are already content
creators in their own right and are not dependent on generative AI to
create. The term also extends to those humans with jobs that are not
usually associated with creativity. Individuals from any discipline or
craft who may not necessarily categorize themselves as creatives or even
creators show ample amounts of creativity. Generative AI can support
those who regularly create and those who may not necessarily have
had training in a specific craft or discipline. Creatives are people who
possess the ability to generate innovative ideas, concepts, and solutions
regardless of their field. They excel in using their imagination, originality,
and artistic or technical skills to produce works that are a unique offering
to the world. Creatives thrive in environments where they can express
their ideas, experiment with different mediums, and push the boundaries
of conventional thinking. You can identify a creative by their incessant
curiosity to explore new things, by their receptivity to new ideas and
perspectives often embracing the unconventional, by their resilience and
determination in the face of design challenges, in how they adjust their
ideas and become adaptable based on the feedback they receive, through
their affirmation of diverse perspectives when collaborating with others,
and through their ability to communicate ideas and concepts to different
types of audiences.
While those artists who have spent much of their lives training to
develop their craft may naturally possess certain creative characteristics,
these qualities can also be learned and developed through practice,
education, and exposure to various types of projects and collaborative

xxxix
Introduction

experiences. Think of a trained classical pianist who has excelled at playing


anything from Bach to Liszt with ease. While they have spent the greater
part of their lives learning to master the technique necessary to read and
play challenging piano music, they may not necessarily be able to transfer
those skills to other creative acts. They may not even consider themselves
to be creative. Conversely, a UX designer who may not have the skills of a
pianist or the training of a 2D artist can still be considered creative when
they contribute to improving the user experience of a mobile application.
The techniques and approaches demonstrated in this book aim to enhance
the way you already create if you are used to being a creator, and if you
regularly research, explore, create, and iterate on anything that eventually
makes itself to your targeted user. Whether you are a painter, sculptor,
composer, storyteller, interaction designer, illustrator, game designer,
sound designer, playwright, tattoo artist, coder, user interface designer,
dancer, theater maker, design thinker, business strategist, NFT creator,
or graphic designer, the methods, use cases, perspectives, and insights
presented in this book will stretch the boundaries of your creativity.

Re(introducing) Prototyping
The content that an AI generates is a catalyst for prototyping. The term
“prototyping” is used throughout the book. While prototyping may seem
more associated with engineers and software developers compared with
those accustomed to a different way of expressing their idea through
traditional artmaking practices, at its core, it is a method for rapidly
generating, testing, and iterating upon ideas. Creatives of all disciplines
have long been engaged in this process—exploring various techniques,
materials, and styles before arriving at the final creation they share for
public consumption. You can experiment with and adapt generative AI if
they fit your own creative process. By trying them out, you test them. By
leveraging generative AI for prototyping, creatives can challenge their own

xl
Introduction

patterned methods of creation, iterate upon their ideas, refine their vision,
and potentially produce something better than what they might have
imagined.
For those more versed in technology development, the concept of
rapid prototyping may already be familiar. However, the integration
of generative AI into your creative process offers new and exciting
possibilities. As you read this book, you will discover how generative
AI can enhance your existing skills and accelerate already familiar
workflows. You will also learn to harness the power of generative AI to
develop novel creative styles and break through blocks and barriers with
prototypes that may take your final work into completely new directions.
Throughout the book, you will find a wealth of examples, case studies,
activities, and takeaways that illustrate the potential of generative AI for
prototyping beginning ideas. These practical resources will help you
build your understanding of the technology and inspire you to integrate
it into your own creative process. Tools and approaches to getting the
most out of generative AI may prove invaluable as you embark to build an
experimental lab with responsive technology.

Informed Choice(s)
To gauge whether you should integrate generative AI into your own
creative process, it is also important to critically discern and understand
the implications of doing so. Amid the debates reverberating around us
on the uses and misuses of AI, one thing is certain: we all need to better
understand how it works before deciding if we’re going to use it. Many
of the warnings and cautionary tales about interacting with artificial
intelligence tend to lump AI into one category, as if narrow AI and general
AI were synonymous, or that all narrow AI machine learning models
are the same. Thus, deepening your understanding of all the creative
potentials that generative AI offers, how it can support your creative

xli
Introduction

activities, and how it can also be used to harm, misrepresent, normalize,


exclude, control, misguide, track, steal, and oppress humans is an essential
part of the process. This is the case for any creative using any technology to
express themselves and share their work in the world (Figure 3).

Figure 3. Frankenstein’s AI lab as the earth itself. Iteration #56 of the


author’s hands holding a globe of the earth

xlii
Introduction

As you sway back and forth between your choice to integrate it in


your creative workflows and not to use it at all, a similar rule of engaging
with any generative AI system applies with all technology; you need to
understand how it might benefit your own creative journey before making
the judgement call on whether you use it. You also need to understand
some of the known consequences of its use and how interacting with
specific narrow AI will impact other humans.
Do you need to use generative AI to prototype? Not at all. In fact, if
you are already a content creator, you may already be satisfied with your
own well-defined iterative and creative process. Can generative AI open
you to more possibilities as a creative tool that also complements your
own practice? Yes. Do you need to adopt a critical view toward the use of
generative AI as you embrace the technology? Absolutely.

Taking Advantage of Generative AI As a Tool


To illustrate the options we have in engaging with generative AI, I
recount the following story that, in part, inspired this book. “Look what
I generated,” I overheard while prompting a barista for a latte at a coffee
shop. “That’s amazing. You should post that.” “I already did.” While the
exchange was short and may have sounded like a completely different
language to some, a few insights came to me. The person was using the
text-image generative AI called Midjourney, and the only way to access
Midjourney is using the communication and instant messaging social
platform Discord. The scene is worth retelling in that it reminded me that
when we interact with any generative AI, as creatives we are faced with
several choices:

• We do nothing with the content we prompt an AI to


generate, except maybe store it in our photo library for
later use if it is created as an isolated and solitary act of
creation out of curiosity or just to pass the time.

xliii
Introduction

• We take what is generated and immediately share it


publicly through whatever medium or social portal
we fancy to show that we are hip and in touch with the
latest trends. In the case of doing so on Discord, our
motivation might be to immediately generate it on a
social thread to receive likes, comments, or affirmation
of our prowess to use the written word to generate
something that often looks incredible.

• We critically analyze the generated content and


choose to try again to see what other marvels the AI
can generate, pay money to have more credits and
access more features, or keep sourcing new and free
generative AI. We can then be inspired to create our
own prototypes from content that is generated or delete
it and start over. Or we can choose not to use generative
AI ever again since we don’t need it as part of our own
process of creation.

When we generate content beyond just having fun with a machine,


creatives might think of how they can use whatever content is generated
for purposes beyond the immediate result. For example, a generated image
can be used to accompany a blog post that you might write, an assignment
you might be working on, a critical deconstruction that highlights inherent
biases in an LLM’s generated content, etc. That process involves a critical
analysis of the content and how it might be used with whatever media we
are creating (Figure 4). For creatives, the last point in the preceding list is
a priority to understand how generated content can be recontextualized,
integrated, modified, regenerated, or used as part of a larger idea or vision
you are creating. What that implies is that you need to actively integrate
generative AI content to support your own creative process rather than
seeing the content it first generates as a final product that needs to be
shared immediately for likes, profit, or showmanship.

xliv
Introduction

Figure 4. An AI bot prompted to “look at yourself reflectively” and


accompanied with a photo of the author looking in a small hand
mirror. Iterations = 45

The Wizard of Oz Prototype


Much to the disappointment of Terminator fans everywhere, it is not the
machine learning model that is capable of developing, nurturing, adapting,
and acting creatively from its pre-programmed intelligence and training
on its own. Better than representing AI as a cyborg intent on destroying

xlv
Introduction

humanity, generative AI systems are more akin to another movie character:


the Wizard of Oz. The reference to the wizard behind the curtain pulling
the strings is the origin of the term “Wizard of Oz” prototype. Generative
AI can be considered a Wizard of Oz (WOz) prototype in that the wizards
behind the scenes make it look like AI possesses creativity, personality,
and intelligence, when in reality it runs simulations based on how it has
been pre-trained on other people’s content. The idea of a WOz originates in
the book The Wonderful Wizard of Oz by L. Frank Baum in addition to the
movie of the same name. The story features a wizard character that appears
to be able to conjure whatever a person wishes. The truth, however, as
revealed by protagonist Dorothy and the friends she makes in Emerald City
is that the wizard is a simple man who is an inventor with no magical skills.
AI’s Wizards of Oz, however, are a collective consisting of analysts who
collect the source images and documents, labelers and annotators who
classify that data, and machine learning engineers who write the code to
interpret that data, train the data, and test that data. The magical intelligent
creature called AI also seems to do it all by itself, but it is trained on a large
amount of data, and behind the scenes, invisible, are those workers who
exert a powerful influence on what we eventually experience as a creative
and intelligent machine. AI cannot learn from specific interactions with
individual users in the way that humans understand what learning is. This
can be explained in more detail by a chatbot itself.
LLM: A chatbot can learn from interactions with individual users
by storing and analyzing the inputs (messages) it receives from the users
and the outputs (responses) it gives. By analyzing a large amount of these
interactions, it can learn the best responses to give in different situations. This
is done not by “understanding” the content of the interactions in the human
sense, but rather by identifying patterns in the data. AI can only learn from
specific user interactions if it is designed to do so and if those interactions
are stored and used as part of the training data. Some AI systems, especially
those used in sensitive areas like healthcare or finance, are specifically

xlvi
Another random document with
no related content on Scribd:
1.C. The Project Gutenberg Literary Archive Foundation (“the
Foundation” or PGLAF), owns a compilation copyright in the
collection of Project Gutenberg™ electronic works. Nearly all the
individual works in the collection are in the public domain in the
United States. If an individual work is unprotected by copyright law in
the United States and you are located in the United States, we do
not claim a right to prevent you from copying, distributing,
performing, displaying or creating derivative works based on the
work as long as all references to Project Gutenberg are removed. Of
course, we hope that you will support the Project Gutenberg™
mission of promoting free access to electronic works by freely
sharing Project Gutenberg™ works in compliance with the terms of
this agreement for keeping the Project Gutenberg™ name
associated with the work. You can easily comply with the terms of
this agreement by keeping this work in the same format with its
attached full Project Gutenberg™ License when you share it without
charge with others.

1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside the
United States, check the laws of your country in addition to the terms
of this agreement before downloading, copying, displaying,
performing, distributing or creating derivative works based on this
work or any other Project Gutenberg™ work. The Foundation makes
no representations concerning the copyright status of any work in
any country other than the United States.

1.E. Unless you have removed all references to Project Gutenberg:

1.E.1. The following sentence, with active links to, or other


immediate access to, the full Project Gutenberg™ License must
appear prominently whenever any copy of a Project Gutenberg™
work (any work on which the phrase “Project Gutenberg” appears, or
with which the phrase “Project Gutenberg” is associated) is
accessed, displayed, performed, viewed, copied or distributed:
This eBook is for the use of anyone anywhere in the United
States and most other parts of the world at no cost and with
almost no restrictions whatsoever. You may copy it, give it away
or re-use it under the terms of the Project Gutenberg License
included with this eBook or online at www.gutenberg.org. If you
are not located in the United States, you will have to check the
laws of the country where you are located before using this
eBook.

1.E.2. If an individual Project Gutenberg™ electronic work is derived


from texts not protected by U.S. copyright law (does not contain a
notice indicating that it is posted with permission of the copyright
holder), the work can be copied and distributed to anyone in the
United States without paying any fees or charges. If you are
redistributing or providing access to a work with the phrase “Project
Gutenberg” associated with or appearing on the work, you must
comply either with the requirements of paragraphs 1.E.1 through
1.E.7 or obtain permission for the use of the work and the Project
Gutenberg™ trademark as set forth in paragraphs 1.E.8 or 1.E.9.

1.E.3. If an individual Project Gutenberg™ electronic work is posted


with the permission of the copyright holder, your use and distribution
must comply with both paragraphs 1.E.1 through 1.E.7 and any
additional terms imposed by the copyright holder. Additional terms
will be linked to the Project Gutenberg™ License for all works posted
with the permission of the copyright holder found at the beginning of
this work.

1.E.4. Do not unlink or detach or remove the full Project


Gutenberg™ License terms from this work, or any files containing a
part of this work or any other work associated with Project
Gutenberg™.

1.E.5. Do not copy, display, perform, distribute or redistribute this


electronic work, or any part of this electronic work, without
prominently displaying the sentence set forth in paragraph 1.E.1 with
active links or immediate access to the full terms of the Project
Gutenberg™ License.
1.E.6. You may convert to and distribute this work in any binary,
compressed, marked up, nonproprietary or proprietary form,
including any word processing or hypertext form. However, if you
provide access to or distribute copies of a Project Gutenberg™ work
in a format other than “Plain Vanilla ASCII” or other format used in
the official version posted on the official Project Gutenberg™ website
(www.gutenberg.org), you must, at no additional cost, fee or expense
to the user, provide a copy, a means of exporting a copy, or a means
of obtaining a copy upon request, of the work in its original “Plain
Vanilla ASCII” or other form. Any alternate format must include the
full Project Gutenberg™ License as specified in paragraph 1.E.1.

1.E.7. Do not charge a fee for access to, viewing, displaying,


performing, copying or distributing any Project Gutenberg™ works
unless you comply with paragraph 1.E.8 or 1.E.9.

1.E.8. You may charge a reasonable fee for copies of or providing


access to or distributing Project Gutenberg™ electronic works
provided that:

• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”

• You provide a full refund of any money paid by a user who


notifies you in writing (or by e-mail) within 30 days of receipt that
s/he does not agree to the terms of the full Project Gutenberg™
License. You must require such a user to return or destroy all
copies of the works possessed in a physical medium and
discontinue all use of and all access to other copies of Project
Gutenberg™ works.

• You provide, in accordance with paragraph 1.F.3, a full refund of


any money paid for a work or a replacement copy, if a defect in
the electronic work is discovered and reported to you within 90
days of receipt of the work.

• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.

1.E.9. If you wish to charge a fee or distribute a Project Gutenberg™


electronic work or group of works on different terms than are set
forth in this agreement, you must obtain permission in writing from
the Project Gutenberg Literary Archive Foundation, the manager of
the Project Gutenberg™ trademark. Contact the Foundation as set
forth in Section 3 below.

1.F.

1.F.1. Project Gutenberg volunteers and employees expend


considerable effort to identify, do copyright research on, transcribe
and proofread works not protected by U.S. copyright law in creating
the Project Gutenberg™ collection. Despite these efforts, Project
Gutenberg™ electronic works, and the medium on which they may
be stored, may contain “Defects,” such as, but not limited to,
incomplete, inaccurate or corrupt data, transcription errors, a
copyright or other intellectual property infringement, a defective or
damaged disk or other medium, a computer virus, or computer
codes that damage or cannot be read by your equipment.

1.F.2. LIMITED WARRANTY, DISCLAIMER OF DAMAGES - Except


for the “Right of Replacement or Refund” described in paragraph
1.F.3, the Project Gutenberg Literary Archive Foundation, the owner
of the Project Gutenberg™ trademark, and any other party
distributing a Project Gutenberg™ electronic work under this
agreement, disclaim all liability to you for damages, costs and
expenses, including legal fees. YOU AGREE THAT YOU HAVE NO
REMEDIES FOR NEGLIGENCE, STRICT LIABILITY, BREACH OF
WARRANTY OR BREACH OF CONTRACT EXCEPT THOSE
PROVIDED IN PARAGRAPH 1.F.3. YOU AGREE THAT THE
FOUNDATION, THE TRADEMARK OWNER, AND ANY
DISTRIBUTOR UNDER THIS AGREEMENT WILL NOT BE LIABLE
TO YOU FOR ACTUAL, DIRECT, INDIRECT, CONSEQUENTIAL,
PUNITIVE OR INCIDENTAL DAMAGES EVEN IF YOU GIVE
NOTICE OF THE POSSIBILITY OF SUCH DAMAGE.

1.F.3. LIMITED RIGHT OF REPLACEMENT OR REFUND - If you


discover a defect in this electronic work within 90 days of receiving it,
you can receive a refund of the money (if any) you paid for it by
sending a written explanation to the person you received the work
from. If you received the work on a physical medium, you must
return the medium with your written explanation. The person or entity
that provided you with the defective work may elect to provide a
replacement copy in lieu of a refund. If you received the work
electronically, the person or entity providing it to you may choose to
give you a second opportunity to receive the work electronically in
lieu of a refund. If the second copy is also defective, you may
demand a refund in writing without further opportunities to fix the
problem.

1.F.4. Except for the limited right of replacement or refund set forth in
paragraph 1.F.3, this work is provided to you ‘AS-IS’, WITH NO
OTHER WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR ANY PURPOSE.

1.F.5. Some states do not allow disclaimers of certain implied


warranties or the exclusion or limitation of certain types of damages.
If any disclaimer or limitation set forth in this agreement violates the
law of the state applicable to this agreement, the agreement shall be
interpreted to make the maximum disclaimer or limitation permitted
by the applicable state law. The invalidity or unenforceability of any
provision of this agreement shall not void the remaining provisions.
1.F.6. INDEMNITY - You agree to indemnify and hold the
Foundation, the trademark owner, any agent or employee of the
Foundation, anyone providing copies of Project Gutenberg™
electronic works in accordance with this agreement, and any
volunteers associated with the production, promotion and distribution
of Project Gutenberg™ electronic works, harmless from all liability,
costs and expenses, including legal fees, that arise directly or
indirectly from any of the following which you do or cause to occur:
(a) distribution of this or any Project Gutenberg™ work, (b)
alteration, modification, or additions or deletions to any Project
Gutenberg™ work, and (c) any Defect you cause.

Section 2. Information about the Mission of


Project Gutenberg™
Project Gutenberg™ is synonymous with the free distribution of
electronic works in formats readable by the widest variety of
computers including obsolete, old, middle-aged and new computers.
It exists because of the efforts of hundreds of volunteers and
donations from people in all walks of life.

Volunteers and financial support to provide volunteers with the


assistance they need are critical to reaching Project Gutenberg™’s
goals and ensuring that the Project Gutenberg™ collection will
remain freely available for generations to come. In 2001, the Project
Gutenberg Literary Archive Foundation was created to provide a
secure and permanent future for Project Gutenberg™ and future
generations. To learn more about the Project Gutenberg Literary
Archive Foundation and how your efforts and donations can help,
see Sections 3 and 4 and the Foundation information page at
www.gutenberg.org.

Section 3. Information about the Project


Gutenberg Literary Archive Foundation
The Project Gutenberg Literary Archive Foundation is a non-profit
501(c)(3) educational corporation organized under the laws of the
state of Mississippi and granted tax exempt status by the Internal
Revenue Service. The Foundation’s EIN or federal tax identification
number is 64-6221541. Contributions to the Project Gutenberg
Literary Archive Foundation are tax deductible to the full extent
permitted by U.S. federal laws and your state’s laws.

The Foundation’s business office is located at 809 North 1500 West,


Salt Lake City, UT 84116, (801) 596-1887. Email contact links and up
to date contact information can be found at the Foundation’s website
and official page at www.gutenberg.org/contact

Section 4. Information about Donations to


the Project Gutenberg Literary Archive
Foundation
Project Gutenberg™ depends upon and cannot survive without
widespread public support and donations to carry out its mission of
increasing the number of public domain and licensed works that can
be freely distributed in machine-readable form accessible by the
widest array of equipment including outdated equipment. Many small
donations ($1 to $5,000) are particularly important to maintaining tax
exempt status with the IRS.

The Foundation is committed to complying with the laws regulating


charities and charitable donations in all 50 states of the United
States. Compliance requirements are not uniform and it takes a
considerable effort, much paperwork and many fees to meet and
keep up with these requirements. We do not solicit donations in
locations where we have not received written confirmation of
compliance. To SEND DONATIONS or determine the status of
compliance for any particular state visit www.gutenberg.org/donate.

While we cannot and do not solicit contributions from states where


we have not met the solicitation requirements, we know of no
prohibition against accepting unsolicited donations from donors in
such states who approach us with offers to donate.

International donations are gratefully accepted, but we cannot make


any statements concerning tax treatment of donations received from
outside the United States. U.S. laws alone swamp our small staff.

Please check the Project Gutenberg web pages for current donation
methods and addresses. Donations are accepted in a number of
other ways including checks, online payments and credit card
donations. To donate, please visit: www.gutenberg.org/donate.

Section 5. General Information About Project


Gutenberg™ electronic works
Professor Michael S. Hart was the originator of the Project
Gutenberg™ concept of a library of electronic works that could be
freely shared with anyone. For forty years, he produced and
distributed Project Gutenberg™ eBooks with only a loose network of
volunteer support.

Project Gutenberg™ eBooks are often created from several printed


editions, all of which are confirmed as not protected by copyright in
the U.S. unless a copyright notice is included. Thus, we do not
necessarily keep eBooks in compliance with any particular paper
edition.

Most people start at our website which has the main PG search
facility: www.gutenberg.org.

This website includes information about Project Gutenberg™,


including how to make donations to the Project Gutenberg Literary
Archive Foundation, how to help produce our new eBooks, and how
to subscribe to our email newsletter to hear about new eBooks.

You might also like