Skip to content
OnMSFT.com
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
Menu
  • Home
  • About
  • Contact
  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Edge
  • Teams
  • Gaming
  1. Home
  2. News
  3. Tech giants turn to synthetic data to power advanced AI models

Tech giants turn to synthetic data to power advanced AI models

OnMSFT Staff OnMSFT Staff
July 19, 2023
2 min read

As reported by Financial Times, AI companies are exploring a new approach to obtain data for powerful generative models: generating information from scratch using synthetic data. Microsoft, OpenAI, and Cohere are among those employing synthetic data—computer-generated information—to train their large language models (LLMs) due to limitations in human-made data.

The launch of Microsoft-backed OpenAI’s ChatGPT has led to various products that generate plausible text, images, or code based on simple prompts. Generative AI has attracted significant interest, with tech giants like Google, Microsoft, and Meta competing.

LLMs powering chatbots like ChatGPT and Google’s Bard primarily rely on web scraping techniques to accumulate data from books, articles, social media, videos, and more.

However, as generative AI software becomes increasingly sophisticated, AI companies face data access and privacy concerns challenges. Synthetic data offers a solution by being cost-effective.

Cohere and competitors use synthetic data generated by AI models and fine-tuned by humans. For example, Cohere might use two AI models simulating a conversation between a math tutor and a student to train a model on advanced mathematics.

Recent research from Microsoft shows synthetic data can effectively train smaller, simpler models. One instance involved a synthetic dataset of short stories generated by GPT-4, which trained a simple LLM to produce coherent and grammatically correct stories.

Startups like Scale AI and Gretel.ai offer synthetic data services, preserving privacy and removing biases. Synthetic data helps financial institutions examine fraud scenarios and other applications.

Critics warn using AI-generated raw data could degrade the technology over time with falsehoods. Nevertheless, AI researchers see synthetic data as a path to superintelligent AI that can create knowledge and ask questions.

Related

Share this article:
Previous Article Google Chrome users now receiving Bing Chat invitations Next Article 22 House Representatives petition FTC to stop opposing Microsoft’s Activision merger

Related Articles

Chrome is adding a scanning animation and ‘Checking download for safety’ message for risky downloads

March 21, 2026

Microsoft says Windows 11 will get faster, quieter, and more customizable

March 20, 2026
Intel says it is listening to feedback as LGA-1954 may support multiple CPU generations including Nova Lake and future desktop processors.

LGA-1954 Could Bring Multi-Generation Support to Intel CPUs

March 20, 2026

Leave a Comment Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Chrome is adding a scanning animation and ‘Checking download for safety’ message for risky downloads
  • Microsoft says Windows 11 will get faster, quieter, and more customizable
  • LGA-1954 Could Bring Multi-Generation Support to Intel CPUs
  • Intel Says It Tried to Help Before Crimson Desert Dropped Arc Support
  • Three Charged for Illegally Sending Nvidia AI Servers to China

Recent Comments

No comments to show.
OnMSFT.com

The Tech News Site

Categories

  • Windows
  • Surface
  • Xbox
  • How-To
  • OnPodcast
  • Gaming
  • Edge
  • Teams

Recent Posts

  • Chrome is adding a scanning animation and 'Checking download for safety' message for risky downloads
  • Microsoft says Windows 11 will get faster, quieter, and more customizable
  • LGA-1954 Could Bring Multi-Generation Support to Intel CPUs
  • Intel Says It Tried to Help Before Crimson Desert Dropped Arc Support
  • Three Charged for Illegally Sending Nvidia AI Servers to China

Quick Links

  • About OnMSFT.com
  • Contact OnMSFT
  • Join Our Team
  • Privacy Policy
© 2010–2026 OnMSFT.com LLC. All rights reserved.
About OnMSFT.comContact OnMSFTPrivacy Policy