HBM146: Theodora

Computer generated text projected on a computer generated waves. Image by Jeff Emtman.

 

How does a computer learn to speak with emotion and conviction? 

Language is hard to express as a set of firm rules.  Every language rule seems to have exceptions and the exceptions have exceptions etcetera.  Typical, “if this then that” approaches to language just don’t work.  There’s too much nuance. 

But each generation of algorithms gets closer and closer. Markov chains were invented in the 1800’s and rely on nothing more than basic probabilities.  It’s a simple idea, just look at an input (like a book), and learn the order in which words tend to appear.  With this knowledge, it’s possible to generate new text in the same style of the input, just by looking up the probability of words that are likely to follow each other.  It’s simple and sometimes half decent, but not effective for longer outputs as this approach tends to lack object permanence and generate run-on sentences. Markov models are  used today in predictive text phone keyboards, but can also be used to predict weather, stock prices, etc. 

There’ve been plenty of other approaches to language generation (and plenty of mishaps as well).  A notable example is CleverBot, which chats with humans and heavily references its previous conversations to generate its results.  Cleverbot’s chatting can sometimes be eerily human, perfectly regurgitating slang, internet abbreviations, obscure jokes.  But it’s kind of a sly trick at the end of the day, and, as with Markov chains, Cleverbot’s AI still doesn’t always grasp grammar and object permanence. 

In the last decade or two, there’s been an explosion in the abilities of a different kind of AI, the Artificial Neural Network.  These “neural nets” are modelled off the way that brains work, running stimuli through their “neurons” and reinforcing paths that yield the best results. 

The outputs are chaotic until they are properly “trained.” But as the training reaches its optimal point, a model emerges that can efficiently process incoming data and spit out output that incorporates the same kinds of nuance, strangeness, and imperfection that you expect to see in the natural world.  Like Markov chains, neural nets have a lot of applications outside language too. 

But these neural networks are complicated, like a brain.  So complicated, in fact, that few try to dissect these trained models to see how they’re actually working.  And tracing it backwards is difficult, but not impossible

If we temporarily ignore the real risk that sophisticated AI language models pose for societies attempting to separate truth from fiction these neural net models allow for some interesting possibilities, namely extracting the language style of a large body of text and using that extracted style to generate new text that’s written in the voice of the original text. 

In this episode, Jeff creates an AI and names it “Theodora.”  She’s trained to speak like a presenter giving a Ted Talk.  The result varies from believable to utter absurdity and causes Jeff to reflect on the continued inability of individuals, AI, and large nonprofits to distinguish between good ideas and absolute madness

 

Three bits of raw output from Theodora. These were text files were sent to Google Cloud’s TTS service for voicing.

 

On the creation of Theodora:  Jeff used a variety of free tools to generate Theodora in the episode.  OpenAI’s Generative Pre-trained Transformer 2 (GPT-2) was turned into the Python library GPT2 Simple by Max Woolf, who also created a tutorial demonstrating how to train the model for free using Google Colab.  Jeff used this tutorial to train Theodora on a corpus of about 900 Ted Talk transcripts for 5,000 training steps. Jeff then downloaded the model locally and used JupyterLab (Python) to generate new text.  That text was then sent to Google Cloud’s Text-To-Speech (TTS) service where it was converted to the voice heard on the episode. 

Producer: Jeff Emtman
Music: Liance

 
 

James Li aka. “Liance.” Photo by Alex Kozobolis

This Painting Doesn't Dry album art (4000 x 4000).jpg

Sponsor: Liance

Independent musician James Li has just released This Painting Doesn’t Dry, an album about the relationship between personal experiences and the story of humanity as a whole.

James made this album while he anxiously watched his homeland of Hong Kong fall into political crisis.

HBM145: The Juice Library

Amanda Petrus dressed as Brunhilde among a field of fruit punch.  Image by Jeff Emtman.

Amanda Petrus dressed as Brunhilde among a field of fruit punch. Image by Jeff Emtman.

 

Like so many others, Amanda Petrus got a bit lost after college. She had a chemistry degree and not a lot of direction.  But she was able to find work at a juice factory in the vineyards of western New York.  Her job was quality control, which meant overnight shifts at the factory, tasting endless cups of fruit punch and comparing them to the ever-evolving set of juice standards that they kept in the “juice library.” 

She calls herself and “odd creature”, especially for the time and place: she was a woman working in a factory dominated by men, she was openly lesbian (and yet still rebuffing advances from her coworkers), and she was a lover of Richard Wagner’s—sometimes dressing up as a Valkyrie.

Unfortunately, much of her time at the factory was characterized by the antics of her juice tasting colleague, Tim, who, in some ways, mirrored the traits of her favorite composer.  He was incredibly gifted at understanding the flavor profile of fruit punch, able to predict the exact ratios of passion fruit, high fructose corn syrup, and red 40 needed to please the factory’s  clients.  But he also shared Wagner’s xenophobia and misogyny, with his own brand of paranoia, too.  Often, Amanda was a target of his outbursts

This came to a head when Amanda was suddenly fired and escorted from the factory after Tim levelled an incredible accusation of conspiracy against her. 

After this incident, Amanda got into grad school, and started her path towards teaching.  She is now a professor of chemistry at the Community College of Rhode Island.  She also runs the website Mail From A Cat where you can order mail...from a cat. 

Producer: Jeff Emtman
Music: The Black Spot, Serocell, Ride of the Valkyries (performed by The United States Marine Band),Overture from The Flying Dutchman (performed by University of Chicago Symphony Orchestra), Prelude from Parsifal (recording from the European Archive). 

 
Image courtesy Amanda Petrus.

Image courtesy Amanda Petrus.

 
Esoteric Bumper Stickers.jpg

Esoteric Bumper Stickers sells waterproof vinyl stickers can fit any feeling. Not just for cars, Esoteric Bumper Stickers can show the world your knowledge of the briny deep, your passion for flora, your love of claws in the dark, etc.

Just added $1000 to the resale value of this car.

HBM144: Keeping A Place

Image by Jeff Emtman. Blended photographs taken from the Ballard Bridge and Wayne Tunnel in Bothell, Washington (featuring murals by Kristen Ramirez)

 

HBM Host Jeff Emtman has always been afraid of losing his memories. Places he cares about keep getting torn down.

Forrest Perrine prepares a balloon at the Green Lake Aqua Theater.

In this episode, Jeff bikes around Seattle recording the sounds of a popping balloon to capture the sound of places he likes: Padelford Hall’s Parking Garage, The Wayne Tunnel in Bothell, his old house in Roosevelt, The Greenlake Aqua Theater, and his front porch on a snowy day.  

The sound of a popping balloon can be used to re-create a space digitally.  These popping sounds are loud ‘impulses’, and the space ‘responds’ accordingly.  These impulse responses can then be fed to an audio effect called a “convolution reverb” which interprets the impulse response and applies it to any incoming sound.  

Rick and Kathy Emtman are heard on this episode.  Forrest Perrine helped with some of the recordings.  

Producer: Jeff Emtman
Music: The Black Spot, August Friis, Serocell, Phantom Fauna

 
 
WITW-01.jpg
WITW.jpg

Walk in the Woods is a free mini zine that you can get in the mail!

Creator Flissy Saucier writes and draws about her experiences walking in the woods in this monthly+ publication.

You can donate to keep the project going and get additional benefits.

HBM143: Laughing Rats and Dawn Rituals

Image by Jeff Emtman. Photo of sage grouse by Bob Wick of the Bureau of Land Management. Orange sky elements are the spectrogram of the sound of the mouse courtship call heard in this episode.

Image by Jeff Emtman. Photo of sage grouse by Bob Wick of the Bureau of Land Management. Orange sky elements are the spectrogram of the sound of the mouse courtship call heard in this episode.

 

Animals sometimes make noises that would be impossible to place without context.  In this episode, three types of animal vocalizations—described by the people who recorded them. 

The monkey who lost their mother. Photo by Stephanie Foden.

Ashley Ahearn: Journalist and producer of Grouse, from Birdnote and Boise State Public Radio

Joel Balsam: Journalist and producer of the upcoming podcast Parallel Lives.  Joel co-created a photo essay for ESPN about the “pororoca”, an Amazonian wave chased each year by surfers. 

Kevin Coffey, Ph.D.: Co-creator of DeepSqueak and researcher at VA Puget Sound and the University of Washington.  Kevin co-authored the paper DeepSqueak: a deep learning-based system for detection and analysis of ultrasonic vocalizations in Nature’s Neuropsychopharmacology journal. 

Also heard: calls of the Indies Short Tailed Cricket (Anurogryllus celerinictus), which may be the perpetrator of the so-called “sonic attacks” recently reported in Cuba.  Sound sent in by HBM listener Isaul in Puerto Rico.  

Producer: Jeff Emtman
Music: The Black Spot

 
Chas Co - Logo.jpg

Sponsor: Chas Co

Chas Co takes care of cats and dogs in Brooklyn (especially in Prospect Lefferts Gardens, Bed Stuy and surrounding neighborhoods). 

Chas Co welcomes pets with special behavioral and medical needs, including those that other services have turned away.  They offer dog walking, cat visiting, and custom care arrangements too. 

Look, it’s Kane!

HBM142: The Vastness of the Universe

Image by Jeff Emtman with source material from the 2016 frequency allotment poster and Greg Zaal’s Dikhololo Night via HDRI Haven.

 

1,420,405,751* hertz is a very important frequency.  It’s the frequency that hydrogen radiates at, creating radio waves that can be detected far away.  And astronomers can learn a lot about the history and shape of the universe by observing this “hydrogen line” frequency with radio telescopes

Extraterrestrial research astronomers also take a lot of interest in the hydrogen line...and it’s for the same exact reason, though the context is different.  It’s thought that if an alien species is capable of communicating with us, wouldn’t they also have figured out the importance of the hydrogen line?  And is it possible that just maybe, they’d use it (or frequencies near it) to communicate with us?  The theory being that the hydrogen line could be used as a kind of universal hailing channel for intelligent species—a representation of a shared understanding of physics. 

Talk of the hydrogen line was front and center in 1977, when an American astronomer named Jerry R Ehman found a very strong signal on the printout from a radio telescope dubbed “The Big Ear” at the Ohio State University.  The signal he found was close to the hydrogen line.  He noted the abnormality of the strong the signal by writing “Wow!” in red ink on the margins of the printout.  The so-called “Wow! Signal” has long been cited as potential evidence for alien communication. 

But Dr. Seth Shostak (senior astronomer at The SETI Institute and co-host of Big Picture Science) isn’t convinced.  His organization searches for extra terrestrial intelligence across the universe with a high degree of skepticism.  And he’s experienced a false positive or two over the years.  Seth thinks the Wow! Signal (and other related anomalous signals) are almost always tied back to human interference

 

An excerpt from the 2016 frequency allotment poster created by the USA’s Department of Congress. Near the middle, we’ve circled the protected hydrogen line frequencies.

 

In 1979 (not long after the Wow! Signal), frequencies near the hydrogen line became protected when a group called the International Telecommunication Union (ITU) created a 1000+ page document that included a worldwide recommendation to keep these channels clear for astronomy and SETI purposes, citing the “special importance to mankind to determine the existence of extraterrestrial civilizations.” (see page 920 of the Finals Acts of the World Administrative Radio Conference, Geneva, 1979)  

Despite this protection, Seth Shostak says there’s still interference on the hydrogen line from human sources.  That interference draws the ire of radio astronomers everywhere. Seth says, “It’s like turning on a bright light in a movie theater—you don’t ingratiate yourself with the patrons.”

Producer: Jeff Emtman
Music: The Black Spot

*Give or take some fractional hertz.

 
 
 
Patreon Social.png

Here Be Monsters’ supporters on Patreon send a small monthly (or yearly) donation to help cover Jeff’s living expenses, help pay contractors, fees, taxes, etc.

Listener Andrew Conkling says he signed up for the Patreon because HBM is one of his favorite podcasts: “I wanted to be part of the journey in seeing it continue.”

Thank you so much, HBM Patrons.

👽👉Become a patron👈👽

 

HBM141: Filthy Riches

Image by Jeff Emtman.

 

When a group of broke college students start throwing lavish feasts, HBM host Jeff Emtman begins to wonder at the source of the food, initially assuming it was stolen.  But he’s soon corrected.  Confronted with the shocking amount of food waste in the local dumpsters, he quickly turns into a freegan dumpster diving evangelist, but is often thwarted by an angry employee of a local produce stand.  An employee whose face is always hidden by a bright headlamp. 

Content Note: Language and descriptions of violence.

These encounters rattle him, making it hard for him to separate reality from his recurring night terrors about the incidents.  But, years later, and more than a hundred of miles away, he has an encounter in a chocolate dumpster which cures him of those nightmares. 

Many thanks to Jesse Chappelle and Hallie Sloan, who helped in the research of this episode. 

 
 
Coffee Beer Logo.jpg

Sponsor


Coffee Beer of Portland Oregon.

Coffee Beer gets you to and from the best parts of your day.  Located at 4142 SE 42nd Ave, Portland, OR 97206, they serve coffee, beer, snacks and groceries for pick-up and delivery.  Order yours at coffeebeerdelivers.com

Coffee Beer’s merch can be shipped!  Shirts, mugs and more, at coffeebeer.me

Thank you Coffee Beer for sponsoring HBM!

Jeff likes Coffee Beer’s “Leave Me Alone” shirt