Live Industry Updates

AI News and Blog Articles

Curated updates from the most trusted sources in artificial intelligence. Stay ahead without the noise.

Top AI News

Hand-picked stories worth reading right now

183 articles found · page 2 of 2

🤖
OpenAI
Dec 18, 2025

Updating our Model Spec with teen protections

OpenAI is updating its Model Spec with new Under-18 Principles that define how ChatGPT should support teens with safe, age-appropriate guidance grounded in developmental science. The update strengthens guardrails, clarifies expected model behavior in higher-risk situations, and builds on our broader work to improve teen safety across ChatGPT.

Product UpdatesRead Summary
🤖
OpenAI
Dec 17, 2025

Introducing OpenAI Academy for News Organizations

OpenAI is launching the OpenAI Academy for News Organizations, a new learning hub built with the American Journalism Project and The Lenfest Institute to help newsrooms use AI effectively. The Academy offers training, practical use cases, and responsible-use guidance to support journalists, editors, and publishers as they adopt AI in their reporting and operations.

Product UpdatesRead Summary
🤖
OpenAI
Dec 12, 2025

BBVA and OpenAI collaborate to transform global banking

BBVA is expanding its work with OpenAI through a multi-year AI transformation program, rolling out ChatGPT Enterprise to all 120,000 employees. Together, the companies will develop AI solutions that enhance customer interactions, streamline operations, and help build an AI-native banking experience.

Read Summary
🤖
OpenAI
Dec 11, 2025

Advancing science and math with GPT-5.2

GPT-5.2 is OpenAI's strongest model yet for math and science, setting new state-of-the-art results on benchmarks like GPQA Diamond and FrontierMath. This post shows how those gains translate into real research progress, including solving an open theoretical problem and generating reliable mathematical proofs.

ResearchRead Summary
🤖
OpenAI
Dec 11, 2025

Update to GPT-5 System Card: GPT-5.2

GPT-5.2 is the latest model family in the GPT-5 series. The comprehensive safety mitigation approach for these models is largely the same as that described in the GPT-5 System Card and GPT-5.1 System Card. Like OpenAI's other models, the GPT-5.2 models were trained on diverse datasets, including information that is publicly available on the internet, information that we partner with third parties to access, and information that our users or human trainers and researchers provide or generate.

Product UpdatesRead Summary
🤖
OpenAI
Nov 20, 2025

OpenAI and Foxconn collaborate to strengthen U.S. manufacturing across the AI supply chain

OpenAI and Foxconn are collaborating to design and manufacture next-generation AI infrastructure hardware in the U.S. The partnership will develop multiple generations of data-center systems, strengthen U.S. supply chains, and build key components domestically to accelerate advanced AI infrastructure.

General AIRead Summary
🤖
OpenAI
Oct 29, 2025

gpt-oss-safeguard technical report

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from a provided policy in order to label content under that policy. In this report, we describe gpt-oss-safeguard's capabilities and provide our baseline safety evaluations on the gpt-oss-safeguard models, using the underlying gpt-oss models as a baseline. For more information about the development and architecture of the underlying gpt-oss models, see the original gpt-oss model model card⁠.

Funding & InvestmentRead Summary
💡
Google DeepMind
Oct 24, 2025

Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad

The International Mathematical Olympiad ("IMO") is the world's most prestigious competition for young mathematicians, and has been held annually since 1959. Each country taking part is represented by six elite, pre-university mathematicians who compete to solve six exceptionally difficult problems in algebra, combinatorics, geometry, and number theory.

General AIRead Summary
🤖
OpenAI
Sep 30, 2025

Sora 2 System Card

Sora 2 is our new state of the art video and audio generation model. Building on the foundation of Sora, this new model introduces capabilities that have been difficult for prior video models to achieve- such as more accurate physics, sharper realism, synchronized audio, enhanced steerability, and an expanded stylistic range.

Product UpdatesRead Summary
🤖
OpenAI
Sep 15, 2025

Addendum to GPT-5 system card: GPT-5-Codex

This addendum to the GPT-5 system card shares a new model: GPT-5-Codex, a version of GPT-5 further optimized for agentic coding in Codex. GPT-5-Codex adjusts its thinking effort more dynamically based on task complexity, responding quickly to simple conversational queries or small tasks, while independently working for longer on more complex tasks.

AI AutomationRead Summary
🤖
OpenAI
Aug 5, 2025

Open Weights and AI for All

AI's next frontier isn't just about capability-it's about who gets to use it. Our mission to put AI in the hands of as many people as possible is what drives us. Today's release of our most capable open-weights models is a major step forward that makes advanced AI more open, flexible, and accessible worldwide.

General AIRead Summary
🤖
OpenAI
Aug 5, 2025

Introducing gpt-oss

We're releasing gpt-oss-120b and gpt-oss-20b-two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. Available under the flexible Apache 2.0 license, these models outperform similarly sized open models on reasoning tasks, demonstrate strong tool use capabilities, and are optimized for efficient deployment on consumer hardware.

Product UpdatesRead Summary
🤖
OpenAI
Jul 22, 2025

Stargate advances with 4.5 GW partnership with Oracle

Oracle and OpenAI have entered an agreement to develop 4.5 gigawatts of additional Stargate data center capacity in the U.S. This investment will create new jobs, accelerate America's reindustrialization, and help advance U.S. AI leadership. It also marks a major milestone for Stargate, OpenAI's AI infrastructure platform and long-term vision to deliver the benefits of AI to everyone.

General AIRead Summary
🤖
OpenAI
Jul 21, 2025

AI as the greatest source of empowerment for all

I've always considered myself a pragmatic technologist-someone who loves technology not for its own sake, but for the direct impact it can have on people's lives. That's what makes this job so exciting, since I believe AI will unlock more opportunities for more people than any other technology in history. If we get this right, AI can give everyone more power than ever.

General AIRead Summary
🤖
OpenAI
Jul 17, 2025

OpenAI nonprofit jam

At OpenAI, we build tools to help people solve hard problems-including nonprofits working on the frontlines of their communities. The OpenAI Academy is teaming up with the Walton Family Foundation, Emerson Collective, and a network of local nonprofit organizations to host the Nonprofit Jam-a one-day, nationwide event bringing together more than 1,000 nonprofit leaders across 10 locations.

Read Summary
🤖
OpenAI
Jun 30, 2025

AI in Australia-OpenAI's Economic Blueprint

Today, OpenAI, in partnership with Mandala Partners, is sharing the OpenAI AI Economic Blueprint for Australia. At a time when boosting productivity has emerged as a national priority for Australia, the Blueprint provides a clear, actionable plan for how Australia can unlock the full economic and social potential of artificial intelligence.

General AIRead Summary
🤖
OpenAI
May 16, 2025

Addendum to o3 and o4-mini system card: Codex

Codex is a cloud-based coding agent. Codex is powered by codex-1, a version of OpenAI o3 optimized for software engineering. codex-1 was trained using reinforcement learning on real-world coding tasks in a variety of environments to generate code that closely mirrors human style and PR preferences, adheres precisely to instructions, and iteratively runs tests until passing results are achieved.

General AIRead Summary
🤖
OpenAI
May 6, 2025

Introducing AI stories: daily benefits shine a light on bigger opportunities

Sam Altman has written that we are entering the Intelligence Age, a time when AI will help people become dramatically more capable. The biggest problems of today-across science, medicine, education, national defense-will no longer seem intractable, but will in fact be solvable. New horizons of possibility and prosperity will open up.

Product UpdatesRead Summary
🤖
OpenAI
Jan 23, 2025

Operator System Card

Drawing from OpenAI's established safety frameworks, this document highlights our multi-layered approach, including model and product mitigations we've implemented to protect against prompt engineering and jailbreaks, protect privacy and security, as well as details our external red teaming efforts, safety evaluations, and ongoing work to further refine these safeguards.

Funding & InvestmentRead Summary
🤖
OpenAI
Jan 21, 2025

Stargate Infrastructure

OpenAI, and our strategic partners, are thrilled about our shared vision for the Infrastructure of AGI. We are energized by the challenges we face and are excited by the prospect of partnering with firms across the industrial base to deliver against our ambitious mission. Specifically, we want to connect with firms across the built data center infrastructure landscape, from power and land to construction to equipment, and everything in between.

General AIRead Summary
🤖
OpenAI
Jun 18, 2024

Surging developer productivity with custom GPTs

Paf adopted ChatGPT Enterprise across its entire company, with engineers using custom GPTs on a daily basis to speed up routine development tasks. Paf also integrated ChatGPT Enterprise into the grit:lab coding academy (gritlab.ax), training the next generation of software developers using an AI-augmented, systems-architecture mindset from day one. In addition to the wide range of use cases for developers and grit:lab students, 70% of Paf employees actively use ChatGPT Enterprise, spanning business teams like finance, HR, marketing, and customer support.

Read Summary
🤖
OpenAI
Jun 17, 2024

Using GPT-4o reasoning to transform cancer care

Color Health is working with OpenAI to pioneer a new way of accelerating cancer patients' access to treatment. Their new Cancer Copilot application uses GPT-4o to identify missing diagnostics and create tailored workup plans, enabling healthcare providers to make evidence-based decisions about cancer screening and treatment.

General AIRead Summary
🤖
OpenAI
May 29, 2024

Enhancing news in ChatGPT with The Atlantic

The Atlantic is announcing a strategic content and product partnership with OpenAI, which positions The Atlantic as a premium news source within OpenAI. The Atlantic's articles will be discoverable within OpenAI's products, including ChatGPT, and as a partner, The Atlantic will help to shape how news is surfaced and presented in future real-time discovery products.

Product UpdatesRead Summary
🤖
OpenAI
May 16, 2024

Creating an AI-powered Magic Studio

Canva is a visual communication platform, enjoyed by more than 175 million people monthly to make presentations, videos, documents, websites, social media graphics and more. A majority of the world's knowledge workers lack design training, but Canva's combination of an easy-to-use interface, vast libraries, and time-saving tools allows anyone to create visually compelling content.

General AIRead Summary
🤖
OpenAI
Feb 15, 2024

Video generation models as world simulators

We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of high fidelity video. Our results suggest that scaling video generation models is a promising path towards building general purpose simulators of the physical world.

General AIRead Summary
🤖
OpenAI
Jan 31, 2024

Building an early warning system for LLM-aided biological threat creation

We're developing a blueprint for evaluating the risk that a large language model (LLM) could aid someone in creating a biological threat. In an evaluation involving both biology experts and students, we found that GPT-4 provides at most a mild uplift in biological threat creation accuracy. While this uplift is not large enough to be conclusive, our finding is a starting point for continued research and community deliberation.

Funding & InvestmentRead Summary
🤖
OpenAI
May 31, 2023

Improving mathematical reasoning with process supervision

We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning ("process supervision") instead of simply rewarding the correct final answer ("outcome supervision"). In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to produce a chain-of-thought that is endorsed by humans.

AI SafetyRead Summary
🤖
OpenAI
Mar 14, 2023

GPT-4

We've created GPT-4, the latest milestone in OpenAI's effort in scaling up deep learning. GPT-4 is a large multimodal model (accepting image and text inputs, emitting text outputs) that, while less capable than humans in many real-world scenarios, exhibits human-level performance on various professional and academic benchmarks.

ResearchRead Summary
🤖
OpenAI
Jan 11, 2023

Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk

OpenAI researchers collaborated with Georgetown University's Center for Security and Emerging Technology and the Stanford Internet Observatory to investigate how large language models might be misused for disinformation purposes. The collaboration included an October 2021 workshop bringing together 30 disinformation researchers, machine learning experts, and policy analysts, and culminated in a co-authored report building on more than a year of research. This report outlines the threats that language models pose to the information environment if used to augment disinformation campaigns and int

ResearchRead Summary
🤖
OpenAI
Jul 14, 2022

DALL·E 2: Extending creativity

As part of our DALL·E 2 research preview, more than 3,000 artists from more than 118 countries have incorporated DALL·E into their creative workflows. The artists in our early access group have helped us discover new uses for DALL·E and have served as key voices as we've made decisions about DALL·E's features.

ResearchRead Summary
🤖
OpenAI
Jun 23, 2022

Learning to play Minecraft with Video PreTraining

We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over 20 minutes (24,000 actions). Our model uses the native human interface of keypresses and mouse movements, making it quite general, and represents a step towards general computer-using agents.

ResearchRead Summary
🤖
OpenAI
Jun 13, 2022

AI-written critiques help humans notice flaws

We trained "critique-writing" models to describe flaws in summaries. Human evaluators find flaws in summaries much more often when shown our model's critiques. Larger models are better at self-critiquing, with scale improving critique-writing more than summary-writing. This shows promise for using AI systems to assist human supervision of AI systems on difficult tasks.

Funding & InvestmentRead Summary
🤖
OpenAI
Mar 4, 2021

Multimodal neurons in artificial neural networks

We've discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP's accuracy in classifying surprising visual renditions of concepts, and is also an important step toward understanding the associations and biases that CLIP and similar models learn.

General AIRead Summary
🤖
OpenAI
Jan 5, 2021

CLIP: Connecting text and images

We're introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision. CLIP can be applied to any visual classification benchmark by simply providing the names of the visual categories to be recognized, similar to the "zero-shot" capabilities of GPT-2 and GPT-3.

Product UpdatesRead Summary
🤖
OpenAI
Jun 17, 2020

Image GPT

We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can generate coherent image completions and samples. By establishing a correlation between sample quality and image classification accuracy, we show that our best generative model also contains features competitive with top convolutional nets in the unsupervised setting.

General AIRead Summary
🤖
OpenAI
May 5, 2020

AI and efficiency

We're releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has been decreasing by a factor of 2 every 16 months. Compared to 2012, it now takes 44 times less compute to train a neural network to the level of AlexNet (by contrast, Moore's Law would yield an 11x cost improvement over this period). Our results suggest that for AI tasks with high levels of recent investment, algorithmic progress has yielded more gains than classical hardware efficiency.

Read Summary
🤖
OpenAI
Apr 16, 2020

Improving verifiability in AI development

We've contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwartz Reisman Institute for Technology and Society, Center for Advanced Study in the Behavioral Sciences, and Center for Security and Emerging Technologies. This report describes 10 mechanisms to improve the verifiability of claims made about AI systems. Developers can use these tools to provide evidence that AI systems are safe, secure, fair, or privacy-preserving. Users, policymakers, and civil society can use these tools to evaluate AI developme

Funding & InvestmentRead Summary
🤖
OpenAI
Apr 14, 2020

OpenAI Microscope

We're introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision "model organisms" which are often studied in interpretability. Microscope makes it easier to analyze the features that form inside these neural networks, and we hope it will help the research community as we move towards understanding these complicated systems.

Product UpdatesRead Summary
🤖
OpenAI
Dec 5, 2019

Deep double descent

We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then improves again with increasing model size, data size, or training time. This effect is often avoided through careful regularization. While this behavior appears to be fairly universal, we don't yet fully understand why it happens, and view further study of this phenomenon as an important research direction.

ResearchRead Summary
🤖
OpenAI
Nov 5, 2019

GPT-2: 1.5B release

As the final model release of GPT-2's staged release, we're releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection of outputs of GPT-2 models. While there have been larger language models released since August, we've continued with our original staged release plan in order to provide the community with a test case of a full staged release process. We hope that this test case will be useful to developers of future powerful models, and we're actively continuing the conversation with the AI community on responsible publication.

Product UpdatesRead Summary
🤖
OpenAI
Oct 15, 2019

Solving Rubik's Cube with a robot hand

We've trained a pair of neural networks to solve the Rubik's Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using the same reinforcement learning code as OpenAI Five paired with a new technique called Automatic Domain Randomization (ADR). The system can handle situations it never saw during training, such as being prodded by a stuffed giraffe. This shows that reinforcement learning isn't just a tool for virtual tasks, but can solve physical-world problems requiring unprecedented dexterity.

General AIRead Summary
🤖
OpenAI
Sep 19, 2019

Fine-tuning GPT-2 from human preferences

We've fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not always match our own. Specifically, for summarization tasks the labelers preferred sentences copied wholesale from the input (we'd only asked them to ensure accuracy), so our models learned to copy. Summarization required 60k human labels; simpler tasks which continue text in various styles required only 5k. Our motivation is to move safety techniques closer to the general task of "machines talkin

AI SafetyRead Summary
🤖
OpenAI
Sep 17, 2019

Emergent tool use from multi-agent interaction

We've observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simulated hide-and-seek environment, agents build a series of six distinct strategies and counterstrategies, some of which we did not know our environment supported. The self-supervised emergent complexity in this simple environment further suggests that multi-agent co-adaptation may one day produce extremely complex and intelligent behavior.

General AIRead Summary
🤖
OpenAI
Aug 22, 2019

Testing robustness against unforeseen adversaries

We've developed a method to assess whether a neural network classifier can reliably defend against adversarial attacks not seen during training. Our method yields a new metric, UAR (Unforeseen Attack Robustness), which evaluates the robustness of a single model against an unanticipated attack, and highlights the need to measure performance across a more diverse range of unforeseen attacks.

Funding & InvestmentRead Summary
🤖
OpenAI
Aug 20, 2019

GPT-2: 6-month follow-up

We're releasing the 774 million parameter GPT-2 language model after the release of our small 124M model in February, staged release of our medium 355M model in May, and subsequent research with partners and the AI community into the model's potential for misuse and societal benefit. We're also releasing an open-source legal agreement to make it easier for organizations to initiate model-sharing partnerships with each other, and are publishing a technical report about our experience in coordinating with the wider AI research community on publication norms.

ResearchRead Summary
🤖
OpenAI
Jul 22, 2019

Microsoft invests in and partners with OpenAI to support us building beneficial AGI

Microsoft is investing $1 billion in OpenAI to support us building artificial general intelligence (AGI) with widely distributed economic benefits. We're partnering to develop a hardware and software platform within Microsoft Azure which will scale to AGI. We'll jointly develop new Azure AI supercomputing technologies, and Microsoft will become our exclusive cloud provider-so we'll be working hard together to further extend Microsoft Azure's capabilities in large-scale AI systems.

General AIRead Summary
🤖
OpenAI
Jul 10, 2019

Why responsible AI development needs cooperation on safety

We've written a policy research paper identifying four strategies that can be used today to improve the likelihood of long-term industry cooperation on safety norms in AI: communicating risks and benefits, technical collaboration, increased transparency, and incentivizing standards. Our analysis shows that industry cooperation on safety will be instrumental in ensuring that AI systems are safe and beneficial, but competitive pressures could lead to a collective action problem, potentially causing AI companies to under-invest in safety. We hope these strategies will encourage greater cooperatio

ResearchRead Summary
🤖
OpenAI
Apr 25, 2019

MuseNet

We've created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles. MuseNet was not explicitly programmed with our understanding of music, but instead discovered patterns of harmony, rhythm, and style by learning to predict the next token in hundreds of thousands of MIDI files. MuseNet uses the same general-purpose unsupervised technology as GPT-2, a large-scale transformer model trained to predict the next token in a sequence, whether audio or text.

General AIRead Summary
🤖
OpenAI
Apr 15, 2019

OpenAI Five defeats Dota 2 world champions

OpenAI Five is the first AI to beat the world champions in an esports game, having won two back-to-back games versus the world champion Dota 2 team, OG, at Finals this weekend. Both OpenAI Five and DeepMind's AlphaStar had previously beaten good pros privately but lost their live pro matches, making this also the first time an AI has beaten esports pros on livestream.

General AIRead Summary
🤖
OpenAI
Mar 21, 2019

Implicit generation and generalization methods for energy-based models

We've made progress towards stable and scalable training of energy-based models (EBMs) resulting in better sample quality and generalization ability than existing models. Generation in EBMs spends more compute to continually refine its answers and doing so can generate samples competitive with GANs at low temperatures, while also having mode coverage guarantees of likelihood-based models. We hope these findings stimulate further research into this promising class of models.

ResearchRead Summary
🤖
OpenAI
Mar 6, 2019

Introducing Activation Atlases

We've created activation atlases (in collaboration with Google researchers), a new technique for visualizing what interactions between neurons can represent. As AI systems are deployed in increasingly sensitive contexts, having a better understanding of their internal decision-making processes will let us identify weaknesses and investigate failures.

Product UpdatesRead Summary
🤖
OpenAI
Mar 4, 2019

Neural MMO: A massively multiagent game environment

We're releasing a Neural MMO, a massively multiagent game environment for reinforcement learning agents. Our platform supports a large, variable number of agents within a persistent and open-ended task. The inclusion of many agents and species leads to better exploration, divergent niche formation, and greater overall competence.

General AIRead Summary
🤖
OpenAI
Feb 19, 2019

AI safety needs social scientists

We've written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans are involved. Properly aligning advanced AI systems with human values requires resolving many uncertainties related to the psychology of human rationality, emotion, and biases. The aim of this paper is to spark further collaboration between machine learning and social science researchers, and we plan to hire social scientists to work on this full time at OpenAI.

ResearchRead Summary
🤖
OpenAI
Feb 14, 2019

Better language models and their implications

We've trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many language modeling benchmarks, and performs rudimentary reading comprehension, machine translation, question answering, and summarization-all without task-specific training.

ResearchRead Summary
🤖
OpenAI
Dec 14, 2018

How AI training scales

We've discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks. Since complex tasks tend to have noisier gradients, increasingly large batch sizes are likely to become useful in the future, removing one potential limit to further growth of AI systems. More broadly, these results show that neural network training need not be considered a mysterious art, but can be rigorized and systematized.

General AIRead Summary
🤖
OpenAI
Dec 6, 2018

Quantifying generalization in reinforcement learning

We're releasing CoinRun, a training environment which provides a metric for an agent's ability to transfer its experience to novel situations and has already helped clarify a longstanding puzzle in reinforcement learning. CoinRun strikes a desirable balance in complexity: the environment is simpler than traditional platformer games like Sonic the Hedgehog but still poses a worthy generalization challenge for state of the art algorithms.

General AIRead Summary
🤖
OpenAI
Nov 7, 2018

Learning concepts with energy functions

We've developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, and furthest, expressed as sets of 2d points. Our model learns these concepts after only five demonstrations. We also show cross-domain transfer: we use concepts learned in a 2d particle environment to solve tasks on a 3-dimensional physics-based robot.

General AIRead Summary
🤖
OpenAI
Oct 22, 2018

Learning complex goals with iterated amplification

We're proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or a reward function. Although this idea is in its very early stages and we have only completed experiments on simple toy algorithmic domains, we've decided to present it in its preliminary state because we think it could prove to be a scalable approach to AI safety.

ResearchRead Summary
🤖
OpenAI
Jul 9, 2018

Glow: Better reversible generative models

We introduce Glow, a reversible generative model which uses invertible 1x1 convolutions. It extends previous work on reversible generative models and simplifies the architecture. Our model can generate realistic high resolution images, supports efficient sampling, and discovers features that can be used to manipulate attributes of data. We're releasing code for the model and an online visualization tool so people can explore and build on these results.

Product UpdatesRead Summary
🤖
OpenAI
Jul 4, 2018

Learning Montezuma's Revenge from a single demonstration

We've trained an agent to achieve a high score of 74,500 on Montezuma's Revenge from a single human demonstration, better than any previously published result. Our algorithm is simple: the agent plays a sequence of games starting from carefully chosen states from the demonstration, and learns from them by optimizing the game score using PPO, the same reinforcement learning algorithm that underpins OpenAI Five.

General AIRead Summary
🤖
OpenAI
Jun 11, 2018

Improving language understanding with unsupervised learning

We've obtained state-of-the-art results on a suite of diverse language tasks with a scalable, task-agnostic system, which we're also releasing. Our approach is a combination of two existing ideas: transformers and unsupervised pre-training. These results provide a convincing example that pairing supervised learning methods with unsupervised pre-training works very well; this is an idea that many have explored in the past, and we hope our result motivates further research into applying this idea on larger and more diverse datasets.

ResearchRead Summary
🤖
OpenAI
May 25, 2018

Gym Retro

We're releasing the full version of Gym Retro, a platform for reinforcement learning research on games. This brings our publicly-released game count from around 70 Atari games and 30 Sega games to over 1,000 games across a variety of backing emulators. We're also releasing the tool we use to add new games to the platform.

Product UpdatesRead Summary
🤖
OpenAI
May 16, 2018

AI and compute

We're releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with a 3.4-month doubling time (by comparison, Moore's Law had a 2-year doubling period)[^footnote-correction]. Since 2012, this metric has grown by more than 300,000x (a 2-year doubling period would yield only a 7x increase). Improvements in compute have been a key component of AI progress, so as long as this trend continues, it's worth preparing for the implications of systems far outside today's capabilities.

General AIRead Summary
🤖
OpenAI
Apr 18, 2018

Evolved Policy Gradients

We're releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a different side of the room from where it was placed during training.

ResearchRead Summary
🤖
OpenAI
Mar 7, 2018

Reptile: A scalable meta-learning algorithm

We've developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on it, and updating the initial parameters towards the final parameters learned on that task. Reptile is the application of the Shortest Descent algorithm to the meta-learning setting, and is mathematically similar to first-order MAML (which is a version of the well-known MAML algorithm) that only needs black-box access to an optimizer such as SGD or Adam, with similar computational efficiency and performance.

General AIRead Summary
🤖
OpenAI
Feb 26, 2018

Ingredients for robotics research

We're releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, all developed for our research over the past year. We've used these environments to train models which work on physical robots. We're also releasing a set of requests for robotics research.

ResearchRead Summary
🤖
OpenAI
Feb 20, 2018

Preparing for malicious uses of AI

We've co-authored a paper that forecasts how malicious actors could misuse AI technology, and potential ways we can prevent and mitigate these threats. This paper is the outcome of almost a year of sustained work with our colleagues at the Future of Humanity Institute, the Centre for the Study of Existential Risk, the Center for a New American Security, the Electronic Frontier Foundation, and others.

ResearchRead Summary
🤖
OpenAI
Feb 15, 2018

Interpretable machine learning through teaching

We've designed a method that encourages AIs to teach each other with examples that also make sense to humans. Our approach automatically selects the most informative examples to teach a concept-for instance, the best images to describe the concept of dogs-and experimentally we found our approach to be effective at teaching both AIs

ResearchRead Summary
🤖
OpenAI
Dec 6, 2017

Block-sparse GPU kernels

We're releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE. We've used them to attain state-of-the-art results in text sentiment analysis and generative modeling of text and images.

General AIRead Summary
🤖
OpenAI
Oct 26, 2017

Learning a hierarchy

We've developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. Our algorithm, when applied to a set of navigation problems, discovers a set of high-level actions for walking and crawling in different directions, which enables the agent to master new navigation tasks quickly.

General AIRead Summary
🤖
OpenAI
Oct 19, 2017

Generalizing from simulation

Our latest robotics techniques allow robot controllers, trained entirely in simulation and deployed on physical robots, to react to unplanned changes in the environment as they solve simple tasks. That is, we've used these techniques to build closed-loop systems rather than open-loop ones as before.

General AIRead Summary
🤖
OpenAI
Oct 11, 2017

Competitive self-play

We've found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without explicitly designing an environment with these skills in mind. Self-play ensures that the environment is always the right difficulty for an AI to improve. Taken alongside our Dota 2 self-play results, we have increasing confidence that self-play will be a core part of powerful AI systems in the future.

General AIRead Summary
🤖
OpenAI
Sep 14, 2017

Learning to model other minds

We're releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner's dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agents that model other minds.

General AIRead Summary
🤖
OpenAI
Aug 18, 2017

OpenAI Baselines: ACKTR & A2C

We're releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we've found gives equal performance. ACKTR is a more sample-efficient reinforcement learning algorithm than TRPO and A2C, and requires only slightly more computation than A2C per update.

Product UpdatesRead Summary
🤖
OpenAI
Aug 16, 2017

More on Dota 2

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In the span of a month, our system went from barely matching a high-ranked player to beating the top pros and has continued to improve since then. Supervised deep learning systems can only be as good as their training datasets, but in self-play systems, the available data improves automatically as the agent gets better.

ResearchRead Summary
🤖
OpenAI
Aug 11, 2017

Dota 2

We've created a bot which beats the world's top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. This is a step towards building AI systems which accomplish well-defined goals in messy, complicated situations involving real humans.

General AIRead Summary
🤖
OpenAI
Aug 3, 2017

Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

General AIRead Summary
🤖
OpenAI
Jul 20, 2017

Proximal Policy Optimization

We're releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

Regulation & PolicyRead Summary
🤖
OpenAI
Jun 13, 2017

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind's safety team, we've developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

AI SafetyRead Summary
🤖
OpenAI
Jun 8, 2017

Learning to cooperate, compete, and communicate

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum-the difficulty of the environment is determined by the skill of your competitors (and if you're competing against clones of yourself, the environment exactly matches your skill level). Second, a multiagent environment has no stable equilibrium: no matter how smart an agent is, there's always pressure to get smarter. These environments have a very different feel from traditional environments, and it'll take a

General AIRead Summary
🤖
OpenAI
Feb 24, 2017

Attacking machine learning with adversarial examples

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they're like optical illusions for machines. In this post we'll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult.

General AIRead Summary
🤖
OpenAI
Jun 16, 2016

Generative models

This post describes four projects that share a common theme of enhancing or using generative models, a branch of unsupervised learning techniques in machine learning. In addition to describing our work, this post will tell you a bit more about generative models: what they are, why they are important, and where they might be going.

General AIRead Summary
🤖
OpenAI
Dec 11, 2015

Introducing OpenAI

OpenAI is a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as a whole, unconstrained by a need to generate financial return. Since our research is free from financial obligations, we can better focus on a positive human impact.

Product UpdatesRead Summary