{"id":5463,"date":"2025-01-11T00:26:00","date_gmt":"2025-01-11T00:26:00","guid":{"rendered":"https:\/\/dalelane.co.uk\/blog\/?p=5463"},"modified":"2026-03-14T21:29:42","modified_gmt":"2026-03-14T21:29:42","slug":"shoebox-an-artificial-intelligence-history-project","status":"publish","type":"post","link":"https:\/\/dalelane.co.uk\/blog\/?p=5463","title":{"rendered":"&#8220;Shoebox&#8221;: an artificial intelligence history project"},"content":{"rendered":"<h3>What was IBM Shoebox?<\/h3>\n<p>IBM Shoebox was the world&#8217;s first speech-recognition system, created in 1961. It was a voice controlled calculator: you input a sum by speaking the numbers zero through nine and six command words, including &#8220;plus&#8221;, &#8220;minus&#8221;, and &#8220;total&#8221;.<\/p>\n<p>To calculate <code style=\"font-weight: bold;\">12 + 34<\/code> you could say &#8220;one two plus three four total&#8221; and it would respond with the answer.<\/p>\n<p>You can see it being used by inventor William Dersch in <a href=\"https:\/\/ibm.biz\/shoebox-demo\">this two-minute demo video<\/a>.<\/p>\n<p><iframe loading=\"lazy\" style=\"border: thin black solid;\" width=\"450\" height=\"270\" src=\"https:\/\/www.youtube.com\/embed\/rQco1sa9AwU?si=x7GDFNSfN6PGDVX1\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><br \/>\n<small><a href=\"https:\/\/youtu.be\/rQco1sa9AwU\">youtu.be\/rQco1sa9AwU<\/a><\/small><\/p>\n<p><!--more-->There is a load of information about Shoebox on <a href=\"https:\/\/www.ibm.com\/history\/voice-recognition\">ibm.com\/history<\/a> and it is worth a read.<\/p>\n<p><img decoding=\"async\" style=\"border: thin black solid; width: 100%; max-width: 450px;\" src=\"https:\/\/images.dalelane.co.uk\/2025-01-10-shoebox\/ibmcom.png?raw=true\"\/><\/p>\n<p>Some of it is just photos and fun trivia. For example, it was named after it&#8217;s size &#8211; as it was the size of a shoebox. And it was a successor of an earlier (larger) prototype in the 1950&#8217;s called &#8220;Suitcase&#8221;.<\/p>\n<p>But there is also technical information &#8211; including hardware specs (Shoebox contained 31 transistors), but perhaps more interestingly there is an explanation of the approach that the system took to recognize words.<\/p>\n<p>This is also covered in a ten-minute film available on the site. This one was also recorded in the 1960&#8217;s, but this one is in colour!<\/p>\n<p><img decoding=\"async\" style=\"border: thin black solid; width: 100%; max-width: 450px;\" src=\"https:\/\/images.dalelane.co.uk\/2025-01-10-shoebox\/video.png?raw=true\"\/><\/p>\n<p>I find it fascinating to compare the coverage that Shoebox received at the time with the way that artificial intelligence is reported today. <a href=\"https:\/\/time.com\/archive\/6625053\/science-shoebox-is-listening\/\">TIME Magazine&#8217;s article from November 1961<\/a> is a good example:<\/p>\n<blockquote><p>Shoebox is not distracted by ordinary room noises\u2014even loud ones\u2014but Dersch talks into its microphone gently and takes pains to pronounce his words completely. Shoebox listens and dutifully prints numbers and symbols on a roll of paper. &#8230; It is not disturbed if \u2018six is pronounced \u201cseex,\u201d but it insists on being obtuse if \u201cfive\u201d is pronounced \u201cfi\u2019,\u201d as is common in rapid speech.<\/p>\n<p>[Shoebox&#8217;s engineers will] &#8230; try to make Shoebox recognize mumbled, slurred, and female voices; at present it can handle only the words of clear-spoken males. Most foreign languages are no problem for Shoebox, but it is baffled by Chinese, Bantu and other tongues that depend on tone for their meaning..<\/p>\n<p>When Shoebox grows up, IBM may set it to work taking down spoken words and numbers for such harried people as airplane pilots or supermarket checkers. Later, it may graduate to recording customers\u2019 orders, controlling machine tools, or solving mathematical problems. Eventually, the day may come when a troubled scientist or businessman can tell his problem by voice to the listening ear of an electronic computer\u2014and get a spoken oracle answer soon after he stops talking.<\/p><\/blockquote>\n<p>We&#8217;ve been <a href=\"https:\/\/www.raspberrypi.org\/blog\/why-were-taking-a-problem-first-approach-to-the-development-of-ai-systems\">anthropomorphizing AI<\/a> for over sixty years.<\/p>\n<h3>Using Shoebox as an educational project<\/h3>\n<p>I think recreating Shoebox today is a great project for children.<\/p>\n<p>It has <strong>a simple goal that they can easily understand<\/strong> &#8211; adding and subtracting numbers is something they have done since they an early age.<\/p>\n<p>Implementing a calculator is <strong>simple to code<\/strong> in low-code environments such as <a href=\"https:\/\/scratch.mit.edu\/\">Scratch<\/a>.<\/p>\n<p>A machine learning model able to recognize fifteen words is <strong>small enough to run on (almost?) any computer<\/strong> so it&#8217;s accessible even to students with low-powered devices.<\/p>\n<p>A model can be trained to recognize numbers and a few commands with a small number of examples, so students can <strong>create this machine learning model for themselves<\/strong> rather than just use an existing model.<\/p>\n<p>It was such a well-documented project that there is a wealth of pictures, videos, documents and news articles about it. Including the history in the project is a great <strong>reminder that artificial intelligence is a field with a long history<\/strong>, and that we&#8217;re all building on the decades of achievements of engineers that came before us.<\/p>\n<h3>Try it for yourself!<\/h3>\n<p>One example of how this could be done is using <a href=\"https:\/\/machinelearningforkids.co.uk\/\">Machine Learning for Kids<\/a>. You can record examples of yourself saying numbers and words like &#8220;plus&#8221; and &#8220;minus&#8221; using your computer&#8217;s microphone.<\/p>\n<p><em>With younger students, it&#8217;d be best not to ask them to record <strong>all<\/strong> numbers as this is likely a little time-consuming. Just record a few instead. A calculator that can do sums with, for example, the digits 1, 2, and 3 still gets the point across just as well.<\/em><\/p>\n<p><img decoding=\"async\" style=\"border: thin black solid; width: 100%; max-width: 450px;\" src=\"https:\/\/images.dalelane.co.uk\/2025-01-10-shoebox\/record.png?raw=true\"\/><\/p>\n<p>The tool shows a spectrogram representation of their recordings. <\/p>\n<p>I always enjoy trying to recognize what I say in these visualisations &#8211; such as the two hard &#8220;T&#8221; sounds in &#8220;total&#8221; in this screenshot.<\/p>\n<p><img decoding=\"async\" style=\"border: thin black solid; width: 100%; max-width: 450px;\" src=\"https:\/\/images.dalelane.co.uk\/2025-01-10-shoebox\/training.png?raw=true\"\/><\/p>\n<p>These recordings can be used to train a custom machine learning model that can then be used in a Scratch project.<\/p>\n<p><img decoding=\"async\" style=\"border: thin black solid; width: 100%; max-width: 450px;\" src=\"https:\/\/images.dalelane.co.uk\/2025-01-10-shoebox\/scratch.png?raw=true\"\/><\/p>\n<p>This is what it looks like in action!<\/p>\n<p><iframe loading=\"lazy\" style=\"border: thin black solid;\" width=\"450\" height=\"280\" src=\"https:\/\/www.youtube.com\/embed\/uJA8LFBmDQA?si=6NN-ZndbuXf4cUlI\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><br \/>\n<small><a href=\"https:\/\/youtu.be\/uJA8LFBmDQA\">youtu.be\/uJA8LFBmDQA<\/a><\/small><\/p>\n<p>If you&#8217;d like to give it a go, I&#8217;ve written step-by-step instructions that you can <a href=\"https:\/\/machinelearningforkids.co.uk\/worksheets\">download as a free PDF<\/a>.<\/p>\n<p><img decoding=\"async\" style=\"border: thin black solid; width: 100%; max-width: 450px;\" src=\"https:\/\/images.dalelane.co.uk\/2025-01-10-shoebox\/worksheet.png?raw=true\"\/><\/p>\n<p>It&#8217;s Creative Commons-licensed and you can <a href=\"https:\/\/github.com\/IBM\/taxinomitis-docs\/blob\/master\/project-worksheets\/msword\/worksheet-shoebox.docx\">download my original Microsoft Word doc<\/a> used to make the PDF. I hope that someone will improve on it to make a more compelling activity from this. I am convinced that there are ways to bring some of this history to life that makes this more than just another AI project.<\/p>\n<h3>What do you think?<\/h3>\n<p>I had a lot of fun learning about Shoebox and putting this together. I&#8217;d love to hear what you think of it, especially if you give it a try for yourself.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>What was IBM Shoebox? IBM Shoebox was the world&#8217;s first speech-recognition system, created in 1961. It was a voice controlled calculator: you input a sum by speaking the numbers zero through nine and six command words, including &#8220;plus&#8221;, &#8220;minus&#8221;, and &#8220;total&#8221;. To calculate 12 + 34 you could say &#8220;one two plus three four total&#8221; [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":5464,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[587],"class_list":["post-5463","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-code","tag-mlforkids-tech"],"_links":{"self":[{"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/posts\/5463","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=5463"}],"version-history":[{"count":1,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/posts\/5463\/revisions"}],"predecessor-version":[{"id":5903,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/posts\/5463\/revisions\/5903"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=\/wp\/v2\/media\/5464"}],"wp:attachment":[{"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=5463"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=5463"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dalelane.co.uk\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=5463"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}