{"id":35608,"date":"2022-04-18T16:26:23","date_gmt":"2022-04-18T20:26:23","guid":{"rendered":"https:\/\/mjtsai.com\/blog\/?p=35608"},"modified":"2022-10-07T16:27:11","modified_gmt":"2022-10-07T20:27:11","slug":"dall-e","status":"publish","type":"post","link":"https:\/\/mjtsai.com\/blog\/2022\/04\/18\/dall-e\/","title":{"rendered":"DALL-E"},"content":{"rendered":"<p><a href=\"https:\/\/stratechery.com\/2022\/dall-e-the-metaverse-and-zero-marginal-content\/\">Ben Thompson<\/a> (<a href=\"https:\/\/news.ycombinator.com\/item?id=31012806\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/stratechery.com\/2022\/dall-e-the-metaverse-and-zero-marginal-content\/\">\n<p>Last week <a href=\"https:\/\/www.theverge.com\/2022\/4\/6\/23012123\/openai-clip-dalle-2-ai-text-to-image-generator-testing\">OpenAI released DALL-E 2<\/a>, which produces (or edits) images based on textual prompts; <a href=\"https:\/\/twitter.com\/BecomingCritter\/status\/1511808277490896903\">this Twitter thread from @BecomingCritter<\/a> has a whole host of example output[&#8230;]<\/p>\n<p>[&#8230;]<\/p>\n<p>[C]reating games, particularly their art, is expensive, and the expense increases the more immersive the experience is. Social media, on the other hand, is cheap because it uses user-generated content, but that content is generally stuck on more basic mediums &mdash; text, pictures, and only recently video. Of course that content doesn&rsquo;t necessarily need to be limited to your network &mdash; an algorithm can deliver anything on the network to any user.<\/p>\n<p>What is fascinating about DALL-E is that it points to a future where these three trends can be combined. DALL-E, at the end of the day, is ultimately a product of human-generated content, just like its GPT-3 cousin. The latter, of course, is about text, while DALL-E is about images. Notice, though, that progression from text to images; it follows that machine learning-generated video is next. This will likely take several years, of course; video is a much more difficult problem, and responsive 3D environments more difficult yet, but this is a path the industry has trod before[&#8230;]<\/p>\n<p>[&#8230;]<\/p>\n<p>Machine learning generated content is just the next step beyond TikTok: instead of pulling content from anywhere on the network, GPT and DALL-E and other similar models generate new content from content, at zero marginal cost. This is how the economics of the metaverse will ultimately make sense: virtual worlds needs virtual content created at virtually zero cost, fully customizable to the individual.<\/p>\n<\/blockquote>\n\n<p><a href=\"https:\/\/www.bramadams.dev\/projects\/dalle-tricks\">Bram Adams<\/a> (via <a href=\"https:\/\/news.ycombinator.com\/item?id=31009129\">Hacker News<\/a>):<\/p>\n<blockquote cite=\"https:\/\/www.bramadams.dev\/projects\/dalle-tricks\">\n<p>DALL&middot;E is intuivitely understandable on an emotional level while simultaneously being quite unintuitive on a logical level.<\/p>\n<p>[&#8230;]<\/p>\n<p>Here are some things I&rsquo;ve picked up so far that I think can help push the dialogue around DALL&middot;E forward.<\/p>\n<\/blockquote>\n\n<p>Previously:<\/p>\n<ul>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2021\/11\/08\/facebook-but-not-meta-ends-face-recognition\/\">Facebook, But Not Meta, Ends Face Recognition<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2021\/06\/29\/github-copilot\/\">GitHub Copilot<\/a><\/li>\n<li><a href=\"https:\/\/mjtsai.com\/blog\/2021\/03\/04\/multimodal-neurons-in-artificial-neural-networks\/\">Multimodal Neurons in Artificial Neural Networks<\/a><\/li>\n<\/ul>\n<p id=\"dall-e-update-2022-04-21\">Update (2022-04-21): <a href=\"https:\/\/twitter.com\/flyosity\/status\/1516607732689907716\">Mike Rundle<\/a>:<\/p>\n<blockquote cite=\"https:\/\/twitter.com\/flyosity\/status\/1516607732689907716\">\n<p>When humanity has access to a machine that can render anything you can possibly think of, the creative side of art will be how to describe such an image in a way that connects to the soul of the AI and how it understands the world.<\/p>\n<\/blockquote>\n\n<p id=\"dall-e-update-2022-10-07\">Update (2022-10-07): <a href=\"https:\/\/twitter.com\/gingerbeardman\/status\/1575247767231930368\">Matt Sephton<\/a>:<\/p>\n<blockquote cite=\"https:\/\/twitter.com\/gingerbeardman\/status\/1575247767231930368\">\n<p>Each have their pros\/cons. In my usage DALL-E could render a Moai &#x1F5FF; exactly but had no idea any the style I was asking for, Midjourney  gave a rough Moai but had exact style, Stable Diffusion gets both. Personally, I prefer the output of Midjourney<\/p>\n<\/blockquote>","protected":false},"excerpt":{"rendered":"<p>Ben Thompson (Hacker News): Last week OpenAI released DALL-E 2, which produces (or edits) images based on textual prompts; this Twitter thread from @BecomingCritter has a whole host of example output[&#8230;] [&#8230;] [C]reating games, particularly their art, is expensive, and the expense increases the more immersive the experience is. Social media, on the other hand, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"apple_news_api_created_at":"2022-04-18T20:26:25Z","apple_news_api_id":"55a3ed97-e72f-440d-9b57-46178c9dd2b0","apple_news_api_modified_at":"2022-10-07T20:27:13Z","apple_news_api_revision":"AAAAAAAAAAAAAAAAAAAABA==","apple_news_api_share_url":"https:\/\/apple.news\/AVaPtl-cvRA2bV0YXjJ3SsA","apple_news_coverimage":0,"apple_news_coverimage_caption":"","apple_news_is_hidden":false,"apple_news_is_paid":false,"apple_news_is_preview":false,"apple_news_is_sponsored":false,"apple_news_maturity_rating":"","apple_news_metadata":"\"\"","apple_news_pullquote":"","apple_news_pullquote_position":"","apple_news_slug":"","apple_news_sections":"\"\"","apple_news_suppress_video_url":false,"apple_news_use_image_component":false,"footnotes":""},"categories":[2],"tags":[1351,101,167,2285,418,619,96],"class_list":["post-35608","post","type-post","status-publish","format-standard","hentry","category-technology","tag-artificial-intelligence","tag-business","tag-copyright","tag-dall-e","tag-game","tag-graphics","tag-web"],"apple_news_notices":[],"_links":{"self":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/35608","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/comments?post=35608"}],"version-history":[{"count":4,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/35608\/revisions"}],"predecessor-version":[{"id":37262,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/posts\/35608\/revisions\/37262"}],"wp:attachment":[{"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/media?parent=35608"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/categories?post=35608"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mjtsai.com\/blog\/wp-json\/wp\/v2\/tags?post=35608"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}