{"id":1261286,"date":"2023-07-25T10:17:31","date_gmt":"2023-07-25T04:47:31","guid":{"rendered":"https:\/\/trak.in\/stories\/?p=1261286"},"modified":"2023-07-25T10:18:06","modified_gmt":"2023-07-25T04:48:06","slug":"chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates","status":"publish","type":"post","link":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/","title":{"rendered":"ChatGPT&#8217;s Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates"},"content":{"rendered":"\n<p>In recent times, there has been a growing number of reports and discussions about a decline in the quality of responses from ChatGPT. To investigate this matter, a team of researchers from Stanford and UC Berkeley conducted a study to quantify the extent of this degradation. The study confirmed that the drop in ChatGPT&#8217;s quality was indeed real.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1000\" height=\"465\" src=\"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png\" alt=\"ChatGPT's Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates\" class=\"wp-image-1261295\" srcset=\"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png 1000w, https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3-300x140.png 300w, https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3-768x357.png 768w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/figure>\n\n\n\n<p>The research paper titled &#8220;How Is ChatGPT&#8217;s Behavior Changing Over Time?&#8221; was <a href=\"https:\/\/www.tomshardware.com\/news\/chatgpt-response-quality-decline\">authored<\/a> by three prominent academics: Matei Zaharia, Lingjiao Chen, and James Zou. Matei Zaharia, who is a Computer Science Professor at UC Berkeley, shared the findings on Twitter, revealing a startling fact that GPT-4&#8217;s success rate in solving certain problems fell drastically from 97.6% to 2.4% between March and June.<\/p>\n\n\n\n<p>GPT-4, which was recently released and acclaimed as OpenAI&#8217;s most advanced model, had been eagerly anticipated by developers for its potential to power innovative AI products. However, the study&#8217;s results showed disappointing performance, especially in handling straightforward queries.<\/p>\n\n\n\n<p>The research team designed tasks to evaluate the quality of responses from the large language models (LLMs) GPT-4 and GPT-3.5. These tasks covered areas such as solving math problems, answering sensitive questions, code generation, and visual reasoning. The chart provided an overview of the performance of both models across their March and June releases in 2023.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh4.googleusercontent.com\/WlQtm35hD1iNoKsWZ4-sPIORoBwtVheN9EcryPotxVEcjupnfSb84yzzJysjnUP3himZafUILPHYs5_zMR6Cs6FcSt1jpyNnJM5MhKG7sAzcRtL3QQcafIonAbRGhpal2Jrf52NkmHTPcsCuse0yseg\" alt=\"\"\/><\/figure>\n\n\n\n<p>The data clearly illustrated that the same LLM service provided different answers over time, showing significant differences in performance within this short period. It remains uncertain how these LLMs are updated and whether changes to improve one aspect of their performance might negatively affect others. Notably, the latest version of GPT-4 performed worse compared to the March version in three testing categories, with only a slight margin of improvement in visual reasoning.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh6.googleusercontent.com\/CMe4qxgN6_5AN2a0V149p0rkn3X_QMAJlRI9xOwLd1MBsv0LJU_SqszTTKaxhwtKxCoOQhMXIcx2q0PpHB9milPFtis6EIlZtTQeApWa2hZ6Ru7qd2qxP8sjMJS_snA2LFZye_0F5KO1y8CmxspNsBo\" alt=\"\"\/><\/figure>\n\n\n\n<p>While some may not be concerned about the variable quality in the &#8220;same versions&#8221; of these LLMs, it is crucial to acknowledge that both GPT-4 and GPT-3.5 have been widely adopted by individual users and businesses due to the popularity of ChatGPT. As such, information generated by these models can significantly impact people&#8217;s lives.<\/p>\n\n\n\n<p>The researchers intend to continue assessing GPT versions in a more extended study. They suggest that OpenAI should consider monitoring and publishing regular quality checks for its paying customers. If not, it may be necessary for business or governmental organizations to keep an eye on basic quality metrics for these LLMs to avoid potential commercial and research impacts.<\/p>\n\n\n\n<p>The AI and LLM technology domain has had its share of surprising issues, and with data privacy concerns and other public relations challenges, it currently seems like the &#8220;wild west&#8221; frontier of connected life and commerce.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In recent times, there has been a growing number of reports and discussions about a decline in the quality of responses from ChatGPT. To investigate this matter, a team of researchers from Stanford and UC Berkeley conducted a study to quantify the extent of this degradation. The study confirmed that the drop in ChatGPT&#8217;s quality [&hellip;]<\/p>\n","protected":false},"author":25,"featured_media":1261295,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[22],"tags":[584],"class_list":["post-1261286","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-chatgpt"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>ChatGPT&#039;s Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates - Trak.in - Indian Business of Tech, Mobile &amp; Startups<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"ChatGPT&#039;s Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates - Trak.in - Indian Business of Tech, Mobile &amp; Startups\" \/>\n<meta property=\"og:description\" content=\"In recent times, there has been a growing number of reports and discussions about a decline in the quality of responses from ChatGPT. To investigate this matter, a team of researchers from Stanford and UC Berkeley conducted a study to quantify the extent of this degradation. The study confirmed that the drop in ChatGPT&#8217;s quality [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/\" \/>\n<meta property=\"og:site_name\" content=\"Trak.in - Indian Business of Tech, Mobile &amp; Startups\" \/>\n<meta property=\"article:published_time\" content=\"2023-07-25T04:47:31+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-07-25T04:48:06+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"465\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Radhika Kajarekar\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Radhika Kajarekar\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/\",\"url\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/\",\"name\":\"ChatGPT's Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates - Trak.in - Indian Business of Tech, Mobile &amp; Startups\",\"isPartOf\":{\"@id\":\"https:\/\/trak.in\/stories\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png\",\"datePublished\":\"2023-07-25T04:47:31+00:00\",\"dateModified\":\"2023-07-25T04:48:06+00:00\",\"author\":{\"@id\":\"https:\/\/trak.in\/stories\/#\/schema\/person\/3d6ff61ba47715670139663cc4767b1c\"},\"breadcrumb\":{\"@id\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#primaryimage\",\"url\":\"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png\",\"contentUrl\":\"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png\",\"width\":1000,\"height\":465},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/trak.in\/stories\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"ChatGPT&#8217;s Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/trak.in\/stories\/#website\",\"url\":\"https:\/\/trak.in\/stories\/\",\"name\":\"Trak.in - Indian Business of Tech, Mobile &amp; Startups\",\"description\":\"Trak.in is a popular Indian Business, Technology, Mobile &amp; Startup blog featuring trending News, views and analytical take on Technology, Business, Finance, Telecom, Mobile, startups &amp; Social Media Space\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/trak.in\/stories\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/trak.in\/stories\/#\/schema\/person\/3d6ff61ba47715670139663cc4767b1c\",\"name\":\"Radhika Kajarekar\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/trak.in\/stories\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/f1db673eec2bc1089f0cb5aabbc342d28aa6e57f0b0ade5099ce322f0a984358?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/f1db673eec2bc1089f0cb5aabbc342d28aa6e57f0b0ade5099ce322f0a984358?s=96&d=mm&r=g\",\"caption\":\"Radhika Kajarekar\"},\"url\":\"https:\/\/trak.in\/stories\/author\/radhika-kajarekar\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"ChatGPT's Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates - Trak.in - Indian Business of Tech, Mobile &amp; Startups","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/","og_locale":"en_US","og_type":"article","og_title":"ChatGPT's Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates - Trak.in - Indian Business of Tech, Mobile &amp; Startups","og_description":"In recent times, there has been a growing number of reports and discussions about a decline in the quality of responses from ChatGPT. To investigate this matter, a team of researchers from Stanford and UC Berkeley conducted a study to quantify the extent of this degradation. The study confirmed that the drop in ChatGPT&#8217;s quality [&hellip;]","og_url":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/","og_site_name":"Trak.in - Indian Business of Tech, Mobile &amp; Startups","article_published_time":"2023-07-25T04:47:31+00:00","article_modified_time":"2023-07-25T04:48:06+00:00","og_image":[{"width":1000,"height":465,"url":"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png","type":"image\/png"}],"author":"Radhika Kajarekar","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Radhika Kajarekar","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/","url":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/","name":"ChatGPT's Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates - Trak.in - Indian Business of Tech, Mobile &amp; Startups","isPartOf":{"@id":"https:\/\/trak.in\/stories\/#website"},"primaryImageOfPage":{"@id":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#primaryimage"},"image":{"@id":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#primaryimage"},"thumbnailUrl":"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png","datePublished":"2023-07-25T04:47:31+00:00","dateModified":"2023-07-25T04:48:06+00:00","author":{"@id":"https:\/\/trak.in\/stories\/#\/schema\/person\/3d6ff61ba47715670139663cc4767b1c"},"breadcrumb":{"@id":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#primaryimage","url":"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png","contentUrl":"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png","width":1000,"height":465},{"@type":"BreadcrumbList","@id":"https:\/\/trak.in\/stories\/chatgpts-mathematical-accuracy-falls-to-shocking-2-response-quality-deteriorates\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/trak.in\/stories\/"},{"@type":"ListItem","position":2,"name":"ChatGPT&#8217;s Mathematical Accuracy Falls To Shocking 2%; Response Quality Deteriorates"}]},{"@type":"WebSite","@id":"https:\/\/trak.in\/stories\/#website","url":"https:\/\/trak.in\/stories\/","name":"Trak.in - Indian Business of Tech, Mobile &amp; Startups","description":"Trak.in is a popular Indian Business, Technology, Mobile &amp; Startup blog featuring trending News, views and analytical take on Technology, Business, Finance, Telecom, Mobile, startups &amp; Social Media Space","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/trak.in\/stories\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/trak.in\/stories\/#\/schema\/person\/3d6ff61ba47715670139663cc4767b1c","name":"Radhika Kajarekar","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/trak.in\/stories\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/f1db673eec2bc1089f0cb5aabbc342d28aa6e57f0b0ade5099ce322f0a984358?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/f1db673eec2bc1089f0cb5aabbc342d28aa6e57f0b0ade5099ce322f0a984358?s=96&d=mm&r=g","caption":"Radhika Kajarekar"},"url":"https:\/\/trak.in\/stories\/author\/radhika-kajarekar\/"}]}},"jetpack_featured_media_url":"https:\/\/trak.in\/stories\/wp-content\/uploads\/2023\/07\/Untitled-design-10-3.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/posts\/1261286","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/users\/25"}],"replies":[{"embeddable":true,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/comments?post=1261286"}],"version-history":[{"count":1,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/posts\/1261286\/revisions"}],"predecessor-version":[{"id":1261296,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/posts\/1261286\/revisions\/1261296"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/media\/1261295"}],"wp:attachment":[{"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/media?parent=1261286"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/categories?post=1261286"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/trak.in\/stories\/wp-json\/wp\/v2\/tags?post=1261286"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}