Blue Diamond Web Services

Your Best Hosting Service Provider!

November 21, 2024

OpenAI accidentally deleted potential evidence in NY Times copyright lawsuit

Lawyers for The New York Times and Daily News, which are suing OpenAI for allegedly scraping their works to train its AI models without permission, say OpenAI engineers accidentally deleted data potentially relevant to the case.

Earlier this fall, OpenAI agreed to provide two virtual machines so counsel for The Times and Daily News could perform searches for copyrighted content in its training data sets. (Virtual machines are software-based computers that exist within another computer’s operating system, often used for the purposes of testing, backing up data, and running apps.) In a letter, attorneys for the publishers say that they and experts have spent over 150 hours since November 1 searching OpenAI’s training data.

But on November 14, OpenAI engineers erased all the publishers’ search data stored on one of the virtual machines, according to the aforementioned letter, which was filed in the U.S. District Court for the Southern District of New York late Wednesday.

OpenAI tried to recover the data — and was mostly successful. However, because the folder structure and file names were “irretrievably” lost, the recovered data “cannot be used to determine where the news plaintiffs’ copied articles were used to build [OpenAI’s] models,” per the letter.

“News plaintiffs have been forced to recreate their work from scratch using significant person-hours and computer processing time,” counsel for The Times and Daily News wrote. “The news plaintiffs learned only yesterday that the recovered data is unusable and that an entire week’s worth of its experts’ and lawyers’ work must be re-done, which is why this supplemental letter is being filed today.”

The plaintiffs’ counsel makes clear that they have no reason to believe the deletion was intentional. But they do say the incident underscores that OpenAI “is in the best position to search its own datasets” for potentially infringing content using its own tools.

We’ve reached out to OpenAI for comment and will update this piece if we hear back.

In this case and others, OpenAI has maintained that training models using publicly available data — including articles from The Times and Daily News — is fair use. In other words, in creating models like GPT-4o, which “learn” from billions of examples of ebooks, essays, and more to generate human-sounding text, OpenAI believes that it isn’t required to license or otherwise pay for the examples — even if it makes money from those models.

That being said, OpenAI has inked licensing deals with a growing number of new publishers, including The Associated Press, Business Insider owner Axel Springer, Financial Times, People parent company Dotdash Meredith, and News Corp. OpenAI has declined to make the terms of these deals public, but one content partner, Dotdash, is reportedly being paid at least $16 million per year.

OpenAI has neither confirmed nor denied that it trained its models on any specific copyrighted works without permission.

Keep reading the article on Tech Crunch


November 20, 2024

Current AI scaling laws are showing diminishing returns, forcing AI labs to change course

AI labs traveling the road to super-intelligent systems are realizing they might have to take a detour.

“AI scaling laws,” the methods and expectations that labs have used to increase the capabilities of their models for the last five years, are now showing signs of diminishing returns, according to several AI investors, founders, and CEOs who spoke with TechCrunch. Their sentiments echo recent reports that indicate models inside leading AI labs are improving more slowly than they used to.

Everyone now seems to be admitting you can’t just use more compute and more data while pretraining large language models, and expect them to turn into some sort of all-knowing digital god. Maybe that sounds obvious, but these scaling laws were a key factor in developing ChatGPT, making it better, and likely influencing many CEOs to make bold predictions about AGI arriving in just a few years.

OpenAI and Safe Super Intelligence co-founder Ilya Sutskever told Reuters last week that “everyone is looking for the next thing” to scale their AI models. Earlier this month, a16z co-founder Marc Andreessen said in a podcast that AI models currently seem to be converging at the same ceiling on capabilities.

But now, almost immediately after these concerning trends started to emerge, AI CEOs, researchers, and investors are already declaring we’re in a new era of scaling laws. “Test-time compute,” which gives AI models more time and compute to “think” before answering a question, is an especially promising contender to be the next big thing.

“We are seeing the emergence of a new scaling law,” said Microsoft CEO Satya Nadella onstage at Microsoft Ignite on Tuesday, referring to the test-time compute research underpinning OpenAI’s o1 model.

He’s not the only one now pointing to o1 as the future.

“We’re now in the second era of scaling laws, which is test-time scaling,” said Andreessen Horowitz partner Anjney Midha, who also sits on the board of Mistral and was an angel investor in Anthropic, in a recent interview with TechCrunch.

If the unexpected success – and now, the sudden slowing – of the previous AI scaling laws tell us anything, it’s that it is very hard to predict how and when AI models will improve.

Regardless, there seems to be a paradigm shift underway: the ways AI labs try to advance their models for the next five years likely won’t resemble the last five.

What are AI scaling laws?

The rapid AI model improvements that OpenAI, Google, Meta, and Anthropic have achieved since 2020 can largely be attributed to one key insight: use more compute and more data during an AI model’s pretraining phase.

When researchers give machine learning systems abundant resources during this phase – in which AI identifies and stores patterns in large datasets – models have tended to perform better at predicting the next word or phrase.

This first generation of AI scaling laws pushed the envelope of what computers could do, as engineers increased the number of GPUs used and the quantity of data they were fed. Even if this particular method has run its course, it has already redrawn the map. Every Big Tech company has basically gone all in on AI, while Nvidia, which supplies the GPUs all these companies train their models on, is now the most valuable publicly traded company in the world.

But these investments were also made with the expectation that scaling would continue as expected.

It’s important to note that scaling laws are not laws of nature, physics, math, or government. They’re not guaranteed by anything, or anyone, to continue at the same pace. Even Moore’s Law, another famous scaling law, eventually petered out — though it certainly had a longer run.

“If you just put in more compute, you put in more data, you make the model bigger – there are diminishing returns,” said Anyscale co-founder and former CEO Robert Nishihara in an interview with TechCrunch. “In order to keep the scaling laws going, in order to keep the rate of progress increasing, we also need new ideas.”

Nishihara is quite familiar with AI scaling laws. Anyscale reached a billion-dollar valuation by developing software that helps OpenAI and other AI model developers scale their AI training workloads to tens of thousands of GPUs. Anyscale has been one of the biggest beneficiaries of pretraining scaling laws around compute, but even its cofounder recognizes that the season is changing.

“When you’ve read a million reviews on Yelp, maybe the next reviews on Yelp don’t give you that much,” said Nishihara, referring to the limitations of scaling data. “But that’s pretraining. The methodology around post-training, I would say, is quite immature and has a lot of room left to improve.”

To be clear, AI model developers will likely continue chasing after larger compute cluster and bigger datasets for pretraining, and there’s probably more improvement to eke out of those methods. Elon Musk recently finished building a supercomputer with 100,000 GPUs, dubbed Colossus, to train xAI’s next models. There will be more, and larger, clusters to come.

But trends suggest exponential growth is not possible by simply using more GPUs with existing strategies, so new methods are suddenly getting more attention.

Test-time compute: the AI industry’s next big bet

When OpenAI released a preview of its o1 model, the startup announced it was part of a new series of models separate from GPT.

OpenAI improved its GPT models largely through traditional scaling laws: more data, more power during pretraining. But now that method reportedly isn’t gaining them much. The o1 framework of models relies on a new concept, test-time compute, so called because the computing resources are used after a prompt, not before. The technique hasn’t been explored much yet in the context of neural networks, but is already showing promise.

Some are already pointing to test-time compute as the next method to scale AI systems.

“A number of experiments are showing that even though pretraining scaling laws may be slowing, the test-time scaling laws – where you give the model more compute at inference – can give increasing gains in performance,” said a16z’s Midha.

“OpenAI’s new ‘o’ series pushes [chain-of-thought] further, and requires far more computing resources, and therefore energy, to do so,” said famed AI researcher Yoshua Benjio in an op-ed on Tuesday. “We thus see a new form of computational scaling appear. Not just more training data and larger models but more time spent ‘thinking’ about answers.”

Over a period of 10 to 30 seconds, OpenAI’s o1 model re-prompts itself several times, breaking down a large problem into a series of smaller ones. Despite ChatGPT saying it is “thinking,” it isn’t doing what humans do — although our internal problem-solving methods, which benefit from clear restatement of a problem and stepwise solutions, were key inspirations for the method.

A decade or so back, Noam Brown, who now leads OpenAI’s work on o1, was trying to build AI systems that could beat humans at poker. During a recent talk, Brown says he noticed at the time how human poker players took time to consider different scenarios before playing a hand. In 2017, he introduced a method to let a model “think” for 30 seconds before playing. In that time, the AI was playing different subgames, figuring out how different scenarios would play out to determine the best move.

Ultimately, the AI performed seven times better than his past attempts.

Granted, Brown’s research in 2017 did not use neural networks, which weren’t as popular at the time. However, MIT researchers released a paper last week showing that test-time compute significantly improves an AI model’s performance on reasoning tasks.

It’s not immediately clear how test-time compute would scale. It could mean that AI systems need a really long time to think about hard questions; maybe hours or even days. Another approach could be letting an AI model “think” through a questions on lots of chips simultaneously.

If test-time compute does take off as the next place to scale AI systems, Midha says the demand for AI chips that specialize in high-speed inference could go up dramatically. This could be good news for startups such as Groq or Cerebras, that specialize in fast AI inference chips. If finding the answer is just as compute-heavy as training the model, the “pick and shovel” providers in AI win again.

The AI world is not yet panicking

Most of the AI world doesn’t seem to be losing their cool about these old scaling laws slowing down. Even if test-time compute does not prove to be the next wave of scaling, some feel we’re only scratching the surface of applications for current AI models.

New popular products could buy AI model developers some time to figure out new ways to improve the underlying models.

“I’m completely convinced we’re going to see at least 10 to 20x gains in model performance just through pure application-level work, just allowing the models to shine through intelligent prompting, UX decisions, and passing context at the right time into the models,” said Midha.

For example, ChatGPT’s Advanced Voice Mode is one the more impressive applications from current AI models. However, that was largely an innovation in user experience, not necessarily the underlying tech. You can see how further UX innovations, such as giving that feature access to the web or applications on your phone, would make the product that much better.

Kian Katanforoosh, the CEO of AI startup Workera and a Stanford adjunct lecturer on deep learning, tells TechCrunch that companies building AI applications, like his, don’t necessarily need exponentially smarter models to build better products. He also says the products around current models have a lot of room to get better.

“Let’s say you build AI applications and your AI hallucinates on a specific task,” said Katanforoosh. “There are two ways that you can avoid that. Either the LLM has to get better and it will stop hallucinating, or the tooling around it has to get better and you’ll have opportunities to fix the issue.”

Whatever the case is for the frontier of AI research, users probably won’t feel the effects of these shifts for some time. That said, AI labs will do whatever is necessary to continue shipping bigger, smarter, and faster models at the same rapid pace. That means several leading tech companies could now pivot how they’re pushing the boundaries of AI.

Keep reading the article on Tech Crunch


OpenAI releases a teacher’s guide to ChatGPT, but some educators are skeptical

OpenAI envisions teachers using its AI-powered tools to create lesson plans and interactive tutorials for students. But some educators are wary of the technology — and its potential to go awry.

Today, OpenAI released a free online course designed to help K-12 teachers learn how to bring ChatGPT, the company’s AI chatbot platform, into their classrooms. Created in collaboration with the nonprofit organization Common Sense Media, with which OpenAI has an active partnership, the one-hour, nine-module program covers the basics of AI and its pedagogical applications.

OpenAI says that it’s already deployed the course in “dozens” of schools, including the Agua Fria School District in Arizona, the San Bernardino School District in California, and the charter school system Challenger Schools. Per the company’s internal research, 98% of participants said the program offered new ideas or strategies that they could apply to their work.

“Schools across the country are grappling with new opportunities and challenges as AI reshapes education,” Robbie Torney, senior director of AI programs at Common Sense Media, said in a statement. “With this course, we are taking a proactive approach to support and educate teachers on the front lines and prepare for this transformation.”

But some educators don’t see the program as helpful — and think it could in fact mislead.

OpenAI Common Sense Media
Image Credits:OpenAI

Lance Warwick, a sports lecturer at the University of Illinois Urbana-Champaign, is concerned resources like OpenAI’s will normalize AI use among educators unaware of the tech’s ethical implications. While OpenAI’s course covers some of ChatGPT’s limitations, like that it can’t fairly grade students’ work, Warwick found the modules on privacy and safety to be “very limited” — and contradictory.

“In the example prompts [OpenAI gives], one tells you to incorporate grades and feedback from past assignments, while another tells you to create a prompt for an activity to teach the Mexican Revolution,” Warwick noted. “In the next module on safety, it tells you to never input student data, and then talks about the bias inherent in generative AI and the issues with accuracy. I’m not sure those are compatible with the use cases.”

Sin á Tres Souhaits, a visual artist and educator at The University of Arizona, says that he’s found AI tools to be helpful in writing assignment guides and other supplementary course materials. But he also says he’s concerned that OpenAI’s program doesn’t directly address how the company might exercise control over content teachers create using its services.

“If educators are creating courses and coursework on a program that gives the company the right to recreate and sell that data, that would destabilize a lot,” Tres Souhaits told TechCrunch. “It’s unclear to me how OpenAI will use, package, or sell whatever is generated by their models.”lo

In its ToS, OpenAI states that it doesn’t sell user data, and that users of its services, including ChatGPT, own the outputs they generate “to the extent permitted by applicable law.” Without additional assurances, however, Tres Souhaits isn’t convinced that OpenAI won’t quietly change its policies in the future.

OpenAI Common Sense Media
Image Credits:OpenAI

“For me, AI is like crypto,” Tres Souhaits said. “It’s new, so it offers a lot of possibility — but it’s also so deregulated that I wonder how much I would trust any guarantee.”

Late last year, the United Nations Educational, Scientific, and Cultural Organization (UNESCO) pushed for governments to regulate the use of AI in education, including implementing age limits for users and guardrails on data protection and user privacy. But little progress has been made on those fronts since — and on AI policy in general.

Tres Souhaits also takes issue with the fact that OpenAI’s program, which OpenAI markets as a guide to “AI, generative AI, and ChatGPT,” doesn’t mention any AI tools besides OpenAI’s own. “It feels like this reinforces the idea that OpenAI is the AI company,” he said. “It’s a smart idea for OpenAI as a business. But we already have a problem with these tech-opolies — companies that have an outsize influence because, as the tech was developed, they put themselves at the center of innovation and made themselves synonymous with the thing itself.”

Josh Prieur, a classroom teacher-turned-product director at educational games company Prodigy Education, had a more upbeat take on OpenAI’s educator outreach. Prieur argues that there are “clear upsides” for teachers if school systems adopt AI in a “thoughtful” and “responsible” way, and he believes that OpenAI’s program is transparent about the risks.

“There remain concerns from teachers around using AI to plagiarize content and dehumanize the learning experience, and also risks around becoming overly reliant on AI,” Preiur said. “But education is often key to overcoming fears around the adoption of new technology in schools, while also ensuring the right safeguards are in place to ensure students are protected and teachers remain in full control.”

OpenAI is aggressively going after the education market, which it sees as a key area of growth.

OpenAI Common Sense Media
Image Credits:OpenAI

In September, OpenAI hired former Coursera chief revenue officer Leah Belsky as its first GM of education, and chargefd her bringing OpenAI’s products to more schools. And in the spring, the company launched ChatGPT Edu, a version of ChatGPT built for universities.

According to Allied Market Research, the AI in education market could be worth $88.2 billion within the next decade. But growth is off to a sluggish start, in large part thanks to skeptical pedagogues.

In a survey this year by the Pew Research Center, a quarter of public K-12 teachers said that using AI tools in education does more harm than good. A separate poll by the Rand Corporation and the Center on Reinventing Public Education found that just 18% of K-12 educators are using AI in their classrooms.

Educational leaders have been similarly reluctant to try AI themselves, or introduce the technology to the educators they oversee. Per educational consulting firm EAB, few district superintendents view addressing AI as a “very urgent” need this year — particularly in light of pressing issues such as understaffing and chronic absenteeism.

Mixed research on AI’s educational impact hasn’t helped convince the non-believers. University of Pennsylvania researchers found that Turkish high school students with access to ChatGPT did worse on a math test than students who didn’t have access. In a separate study, researchers observed that German students using ChatGPT were able to find research materials more easily, but tended to synthesize those materials less skillfully than their non-ChatGPT-using peers.

As OpenAI writes in its guide, ChatGPT isn’t a substitute for engagement with students. Some educators and schools may never be convinced it’s a substitute for any step in the teaching process.

Keep reading the article on Tech Crunch


and this