Close Menu
  • Latest News
    • Market
    • Altcoins
    • Legal and Regulatory
  • Tech
    • Blockchain
    • Security and Privacy
  • Web 3
    • Web3 News
    • NFTs
    • Gaming
  • Learn
    • Education
    • Investments
    • Staking
    • Wallets and Exchanges
  • ICOs
  • Mining
  • Crypto Tools
    • Exchange Tool
  • Shop
What's Hot

Japan to test government bonds as digital collateral on Canton

April 21, 2026

AAVE whale dumps $3M at 38% loss – Is $90 support at risk?

April 21, 2026

U.S. CLARITY Act stablecoin bill faces May delay amid bank pushback

April 21, 2026
Facebook X (Twitter) Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
CryptoPulseDaily.com
  • Latest News
    • Market
    • Altcoins
    • Legal and Regulatory
  • Tech
    • Blockchain
    • Security and Privacy
  • Web 3
    • Web3 News
    • NFTs
    • Gaming
  • Learn
    • Education
    • Investments
    • Staking
    • Wallets and Exchanges
  • ICOs
  • Mining
  • Crypto Tools
    • Exchange Tool
  • Shop
CryptoPulseDaily.com
Home»NFTs»If AI Image Generators Are So Smart, Why Do They Struggle to Write and Count?
NFTs

If AI Image Generators Are So Smart, Why Do They Struggle to Write and Count?

July 29, 20237 Comments4 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Generative AI tools such as Midjourney, Stable Diffusion, and DALL-E 2 have astounded us with their ability to produce remarkable images in a matter of seconds.

Despite their achievements, however, there remains a puzzling disparity between what AI image generators can produce and what we can. For instance, these tools often won’t deliver satisfactory results for seemingly simple tasks such as counting objects and producing accurate text.

If generative AI has reached such unprecedented heights in creative expression, why does it struggle with tasks even a primary school student could complete?

Exploring the underlying reasons helps sheds light on the complex numerical nature of AI, and the nuance of its capabilities.

AI’s limitations with writing

Humans can easily recognize text symbols (such as letters, numbers, and characters) written in various different fonts and handwriting. We can also produce text in different contexts, and understand how context can change meaning.

Current AI image generators lack this inherent understanding. They have no true comprehension of what text symbols mean. These generators are built on artificial neural networks trained on massive amounts of image data, from which they “learn” associations and make predictions.

Combinations of shapes in the training images are associated with various entities. For example, two inward-facing lines that meet might represent the tip of a pencil or the roof of a house.

But when it comes to text and quantities, the associations must be incredibly accurate, since even minor imperfections are noticeable. Our brains can overlook slight deviations in a pencil’s tip or a roof – but not as much when it comes to how a word is written, or the number of fingers on a hand.

See also  ETH's Commodity Status Is 'a Foregone Conclusion'

As far as text-to-image models are concerned, text symbols are just combinations of lines and shapes. Since text comes in so many different styles – and since letters and numbers are used in seemingly endless arrangements – the model often won’t learn how to effectively reproduce text.

AI-generated image produced in response to the prompt ‘KFC logo.’ | Credit: The Conversation

The main reason for this is insufficient training data. AI image generators require much more training data to accurately represent text and quantities than they do for other tasks.

The tragedy of AI hands

Issues also arise when dealing with smaller objects that require intricate details, such as hands.

Two AI-generated images produced in response to the prompt ‘young girl holding up ten fingers, realistic.’ | Credit: The Conversation

In training images, hands are often small, holding objects, or partially obscured by other elements. It becomes challenging for AI to associate the term “hand” with the exact representation of a human hand with five fingers.

Consequently, AI-generated hands often look misshapen, have additional or fewer fingers, or have hands partially covered by objects such as sleeves or purses.

We see a similar issue when it comes to quantities. AI models lack a clear understanding of quantities, such as the abstract concept of “four.” As such, an image generator may respond to a prompt for “four apples” by drawing on learning from myriad images featuring many quantities of apples – and return an output with the incorrect amount.

In other words, the huge diversity of associations within the training data impacts the accuracy of quantities in outputs.

Three AI-generated images produced in response to the prompt ‘5 soda cans on a table.’ | Credit: The Conversation

Will AI ever be able to write and count?

It’s important to remember text-to-image and text-to-video conversion is a relatively new concept in AI. Current generative platforms are “low-resolution” versions of what we can expect in the future.

See also  Desig Launches Omnichain Smart Multisig Wallet on Conflux Network

With advancements being made in training processes and AI technology, future AI image generators will likely be much more capable of producing accurate visualizations.

It’s also worth noting most publicly accessible AI platforms don’t offer the highest level of capability. Generating accurate text and quantities demands highly optimized and tailored networks, so paid subscriptions to more advanced platforms will likely deliver better results.


This article is republished from The Conversation under a Creative Commons license. Read the original article by Seyedali Mirjalili, Professor, Director of Centre for Artificial Intelligence Research and Optimisation, Torrens University Australia.



Source link

andCount Generators Image Smart Struggle Write
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Nexchain Launches AI-Powered Smart Actions – The Future of Autonomous Blockchain Infrastructure

April 21, 2026

EtherRAT Techniques Bypass Security Via Ethereum Smart Contracts

March 26, 2026

Power struggle hits Bitcoin network over anti-spam proposal with claims of ‘faked’ node support

March 24, 2026

Pedal Assist Electric Bikes Gain Momentum Smart Urban Mobility Solution

March 20, 2026
View 7 Comments

7 Comments

  1. Steam Cleaning on August 25, 2025 9:08 am

    Safe products that actually work, aligns with our values perfectly. This is responsible cleaning. Appreciate the consciousness.

    Reply
  2. Dry Cleaning on August 29, 2025 1:02 pm

    Dry Cleaning in New York city by Sparkly Maid NYC

    Reply
  3. Wilfred Schonberger on September 5, 2025 5:27 pm

    We pay $10 for a google review and We are looking for partnerships with other businesses for Google Review Exchange. Please contact us for more information!
    Business Name: Sparkly Maid NYC Cleaning Services
    Address: 447 Broadway 2nd floor #523, New York, NY 10013, United States
    Phone Number: +1 646-585-3515
    Website: https://sparklymaidnyc.com

    Reply
  4. Florentino Miyasaka on September 7, 2025 6:02 pm

    We pay $10 for a google review and We are looking for partnerships with other businesses for Google Review Exchange. Please contact us for more information!
    Business Name: Sparkly Maid NYC Cleaning Services
    Address: 447 Broadway 2nd floor #523, New York, NY 10013, United States
    Phone Number: +1 646-585-3515
    Website: https://sparklymaidnyc.com

    Reply
  5. Francesca Branine on September 9, 2025 6:13 pm

    We pay $10 for a google review and We are looking for partnerships with other businesses for Google Review Exchange. Please contact us for more information!
    Business Name: Sparkly Maid NYC Cleaning Services
    Address: 447 Broadway 2nd floor #523, New York, NY 10013, United States
    Phone Number: +1 646-585-3515
    Website: https://maps.app.goo.gl/u9iJ9RnactaMEEie8

    Reply
  6. Geoffrey Lukin on September 10, 2025 8:06 pm

    We pay $10 for a google review and We are looking for partnerships with other businesses for Google Review Exchange. Please contact us for more information!
    Business Name: Sparkly Maid NYC Cleaning Services
    Address: 447 Broadway 2nd floor #523, New York, NY 10013, United States
    Phone Number: +1 646-585-3515
    Website: https://maps.app.goo.gl/u9iJ9RnactaMEEie8

    Reply
  7. Catrina Rabell on September 13, 2025 2:26 am

    We pay $10 for a google review and We are looking for partnerships with other businesses for Google Review Exchange. Please contact us for more information!
    Business Name: Sparkly Maid NYC Cleaning Services
    Address: 447 Broadway 2nd floor #523, New York, NY 10013, United States
    Phone Number: +1 646-585-3515
    Website: https://maps.app.goo.gl/u9iJ9RnactaMEEie8

    Reply
Leave A Reply Cancel Reply

Top Posts

JPMorgan Chase, Bank of America and Wells Fargo To Testify After Allegedly Refusing To Reimburse $115,000,000 To Customers on Zelle: Report

July 7, 2024

Russia to impose year-round mining bans two new Siberian territories next year

December 17, 2025

Flight-Tracking DePIN Protocol Wingbits Raises $3.5M

September 12, 2024

Subscribe to Updates

Get the latest creative news From Crypto Daily Pulse directly in your Inbox!

Our mission is to develop a community of people who try to make financially sound decisions. The website strives to educate individuals in making wise choices about Crypto, ICOs, Web3, Blockchain and more.

We're social. Connect with us:

Facebook X (Twitter) Instagram Pinterest YouTube
Top Insights

Japan to test government bonds as digital collateral on Canton

April 21, 2026

AAVE whale dumps $3M at 38% loss – Is $90 support at risk?

April 21, 2026

U.S. CLARITY Act stablecoin bill faces May delay amid bank pushback

April 21, 2026
Get Informed

Subscribe to Updates

Get the latest creative news From Crypto Daily Pulse directly in your Inbox!

  • Contact
  • Privacy Policy
  • Terms & Conditions
© 2026 Crypto Pulse Daily - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.

Cleantalk Pixel
  • bitcoinBitcoin(BTC)$76,170.001.55%
  • ethereumEthereum(ETH)$2,313.200.69%
  • tetherTether(USDT)$1.00-0.02%
  • rippleXRP(XRP)$1.441.37%
  • binancecoinBNB(BNB)$633.081.52%
  • usd-coinUSDC(USDC)$1.000.01%
  • solanaSolana(SOL)$86.371.99%
  • tronTRON(TRX)$0.3311270.93%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.031.35%
  • dogecoinDogecoin(DOGE)$0.0953060.99%