Close Menu
  • Latest News
    • Market
    • Altcoins
    • Legal and Regulatory
  • Tech
    • Blockchain
    • Security and Privacy
  • Web 3
    • Web3 News
    • NFTs
    • Gaming
  • Learn
    • Education
    • Investments
    • Staking
    • Wallets and Exchanges
  • ICOs
  • Mining
  • Crypto Tools
    • Exchange Tool
  • Shop
What's Hot

DECISIONS OF THE ANNUAL GENERAL MEETING OF DIGITALIST GROUP PLC ON 28 APRIL 2026 AND OF THE BOARD OF DIRECTORS’ ORGANISATIONAL MEETING

April 28, 2026

Binance Ethereum Supply Hits 2020 Levels While Staking Locks A Third: Repricing Ahead?

April 28, 2026

Bitcoin miner Core Scientific shifts to AI with 1.5GW data center push

April 28, 2026
Facebook X (Twitter) Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
CryptoPulseDaily.com
  • Latest News
    • Market
    • Altcoins
    • Legal and Regulatory
  • Tech
    • Blockchain
    • Security and Privacy
  • Web 3
    • Web3 News
    • NFTs
    • Gaming
  • Learn
    • Education
    • Investments
    • Staking
    • Wallets and Exchanges
  • ICOs
  • Mining
  • Crypto Tools
    • Exchange Tool
  • Shop
CryptoPulseDaily.com
Home»NFTs»New Study Calls Out ChatGPT-4 For Declining Performance
NFTs

New Study Calls Out ChatGPT-4 For Declining Performance

July 24, 2023No Comments4 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email

Recent observations from users and now researchers suggest that ChatGPT, the renowned artificial intelligence (AI) model developed by OpenAI, may be exhibiting signs of performance degradation. However, the reasons behind these perceived changes remain a topic of debate and speculation.

Last week, a study emerged from a collaboration between Stanford University and UC Berkeley which was published in the ArXiv preprint archive and highlighted noticeable differences in the responses of GPT-4 and its predecessor, GPT-3.5, over a span of a few months since the former’s March 13 debut.

A decline in accurate responses

One of the most striking findings was GPT-4’s reduced accuracy in answering complex mathematical questions. For instance, while the model demonstrated a high success rate (97.6 percent) in answering queries about large-scale prime numbers in March, its accuracy in answering that same prompt correctly plummeted to a mere 2.4 percent in June.

The study also pointed out that, while older versions of the bot offered detailed explanations for their answers, the latest iterations seemed more reticent, often forgoing step-by-step solutions even when explicitly prompted. Interestingly, during the same period, GPT-3.5 showed improved capabilities in addressing basic math problems, though it still struggled with more intricate code generation tasks.

Glad that someone did a scientific study showing what we’ve all observed:

ChatGPT (GPT4) has become worse over time.

I still use it regularly and pay the $20/month but hope it gets better soon. pic.twitter.com/IwQl4zP8R1

— Peter Yang (@petergyang) July 19, 2023

These findings have fueled online discussions on the topic, particularly among regular ChatGPT users how have long wondered about the possibility of the program being “neutered.” Many have taken to platforms like Reddit to share their experiences, with some speculating whether GPT-4’s performance is genuinely deteriorating or if users are becoming more discerning of the system’s inherent limitations. Some users recounted instances where the AI failed to restructure text as requested, opting instead for fictional narratives. Others highlighted the model’s struggles with basic problem-solving tasks, spanning both mathematics and coding.

See also  SEC charges Titan Global Capital Management for ‘misleading’ performance metrics

Coding ability changes, speculation, and more

The research team also delved into GPT-4’s coding capabilities, which appeared to have regressed. When the model was tested using problems from the online learning platform LeetCode, only 10 percent of the generated code adhered to the platform’s guidelines. This marked a significant drop from a 50 percent success rate observed in March.

OpenAI’s approach to updating and fine-tuning its models has always been somewhat enigmatic, leaving users and researchers to speculate about the changes made behind the scenes. With global concerns and ongoing legislation in the works surrounding AI regulation and its ethical use, transparency is increasingly on the minds of government regulators and even everyday users of the AI-based tech products that are emerging ever-more frequently.

While the model’s responses seemed to lack the depth and rationale observed in earlier versions, the recent study did note some positive developments: GPT-4 demonstrated enhanced resistance to certain types of attacks and showed a reduced propensity to respond to harmful prompts.

Peter Welinder, OpenAI’s VP of Product, addressed the concerns of the public more than a week before the study was released, stating that GPT-4 has not been “dumbed down.” He suggested that as more users engage with ChatGPT, they might become more attuned to its limitations.

No, we haven’t made GPT-4 dumber. Quite the opposite: we make each new version smarter than the previous one.

Current hypothesis: When you use it more heavily, you start noticing issues you didn’t see before.

— Peter Welinder (@npew) July 13, 2023

While the study offers valuable insights, it also raises more questions than it answers. The dynamic nature of AI models, combined with the proprietary nature of their development, means that users and researchers must often navigate a landscape of uncertainty. As AI continues to shape the future of technology and communication, the call for transparency and accountability is likely to only grow louder.

See also  An NFT Exhibition Seeking to Redefine Black Visibility Through AI



Source link

calls ChatGPT4 declining Performance study
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

ZetaChain GPT-5.5 Integration Transforms Decentralized AI with Unmatched Privacy and Performance

April 27, 2026

Study Shows Implicity’s New Agnostic Cloud-Based AI Algorithm Further Reduces False Alerts Even After Manufacturer AI Filtering in Modern Devices

April 25, 2026

95% Of AI Projects Fail to Deliver Business Impact, MIT-Affiliated Study Finds — German Startup Bucks the Trend, Appoints Georgios Pipelidis to Lead U.S. Expansion

April 25, 2026

75% of crypto tax forms are under $50 – Kraken calls for ‘de minimis’ rule

April 23, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Fallen FTX sues Ryan Salame for $98.8 million over alleged fraud

November 11, 2024

Bitcoin Miner Riot Platforms Is Getting Closer to Taking Over Bitfarms by Force

August 20, 2024

Operation First Light Seizes $257m in Global Scam Bust

June 27, 2024

Subscribe to Updates

Get the latest creative news From Crypto Daily Pulse directly in your Inbox!

Our mission is to develop a community of people who try to make financially sound decisions. The website strives to educate individuals in making wise choices about Crypto, ICOs, Web3, Blockchain and more.

We're social. Connect with us:

Facebook X (Twitter) Instagram Pinterest YouTube
Top Insights

DECISIONS OF THE ANNUAL GENERAL MEETING OF DIGITALIST GROUP PLC ON 28 APRIL 2026 AND OF THE BOARD OF DIRECTORS’ ORGANISATIONAL MEETING

April 28, 2026

Binance Ethereum Supply Hits 2020 Levels While Staking Locks A Third: Repricing Ahead?

April 28, 2026

Bitcoin miner Core Scientific shifts to AI with 1.5GW data center push

April 28, 2026
Get Informed

Subscribe to Updates

Get the latest creative news From Crypto Daily Pulse directly in your Inbox!

  • Contact
  • Privacy Policy
  • Terms & Conditions
© 2026 Crypto Pulse Daily - All rights reserved.

Type above and press Enter to search. Press Esc to cancel.

Cleantalk Pixel
  • bitcoinBitcoin(BTC)$76,022.00-0.85%
  • ethereumEthereum(ETH)$2,277.960.14%
  • tetherTether(USDT)$1.00-0.02%
  • rippleXRP(XRP)$1.37-1.01%
  • binancecoinBNB(BNB)$622.750.32%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$83.49-0.75%
  • tronTRON(TRX)$0.323045-0.59%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.042.30%
  • dogecoinDogecoin(DOGE)$0.0992792.09%