Liwaiwai Liwaiwai



Liwaiwai Liwaiwai
  • /
  • Artificial Intelligence
  • Machine Learning
  • Robotics
  • Engineering
    • Architecture
    • Design
    • Software
    • Hybrid Cloud
    • Data
  • About
  • Artificial Intelligence
  • Data
  • Data Science
  • Machine Learning
  • Technology

Data Labeling for AI Products: How to Process Thousands of Data Labels

  • November 27, 2021
  • Aelia Vita

Today, the AI industry is on the rise – the global AI market is valued at over $40 billion and continues to grow at a rapid pace. The industry relies on three vital components for its development: (1) the hardware that provides computational power and virtual living space for Machine Learning (ML) algorithms; (2) the actual ML algorithms, which is code written by data scientists to instruct the machine on how to teach itself; and finally (3) the data that needs to be labelled and fed into the machine along with any accompanying training instructions, so that it has some substance to fall back onto. Without the data, the first two components can do as much as an automobile can without a full tank – absolutely nothing.

As a result, data labeling has emerged as one of the bedrocks of the current AI boom, a statement supported by Gartner’s Hype Cycle Report for Data Science and ML.

Importantly, whereas both hardware solutions and training algorithms are obtainable commodities that are more or less the same among data scientists, the data annotation component still varies significantly. Different data-labeling approaches and methodologies exist, each one with its fortes and foibles.

Importance of data

The truth is that no matter the labeling approach, whoever can get their hands on more data and whoever can label this data faster, cheaper, and more accurately – has a competitive advantage. This is why the data-labeling market is also growing, in fact at almost 30% per year. Some sources go as far as to predict that the global data-labeling market will exceed $8 billion by 2030. More is yet to come.

Below are a handful of recent case studies that show the power of data labeling in action.

Case study 1: Neatsy / content collection

Content collection can take many shapes – from voice assistants, chatbots, and web content extraction to field tasks for retail. In the case of Neatsy, an app that takes your measurements and finds you the right shoe size, it was about collecting 50,000 images of human feet.

The app’s main feature is a 3D scanner that creates a 3D model of the customer’s foot and then uses it to recommend the best fitting shoes. The scanner works as a sophisticated neural network – its main task is to successfully separate human feet from the floor. Initially, the company built its first MVP using around 1,000 labeled images. When Neatsy’s team realized that for their app to work they would need 50 times as many labeled images, they turned to AI and had all of the images labeled within a week.

Neatsy then requested an advanced assignment that required more precise foot outlines. This involved creating a special training pool for performers with a follow-up competency test. Another three weeks later, the company received all 50,000 labeled images.

Neatsy noted that the app’s time to market was accelerated significantly and their 3D scanner became 12% more accurate.

Neatsy: an app that takes your measurements and finds you the right shoe size

Case study 2: Toloka / brand identity and decision-making

SMM teams need to engage and understand their target audience. There are a number of ways to do it: intent classification, sentiment analysis, utterance collection, and surveys.

One of the examples of this is when a company is in need of rebranding as it aims to become a globally recognizable business with its own corporate identity. As was the case when our team went through our own rebranding earlier this year, agreeing on the right design can be challenging. Especially when you have more than one design variant on your hands and more than one voice within your team, which often is the case.

READ MORE: [button style='accent' url='https://liwaiwai.com/2020/08/31/how-the-shady-world-of-the-data-industry-strips-away-our-freedoms/' target='_blank' arrow='true' fullwidth='true']HOW THE SHADY WORLD OF THE DATA INDUSTRY STRIPS AWAY OUR FREEDOMS[/button]

[button style='accent' url='https://liwaiwai.com/2020/08/13/data-systems-that-learn-to-be-better/' target='_blank' arrow='true' fullwidth='true']DATA SYSTEMS THAT LEARN TO BE BETTER[/button]

With crowdsourcing, for example, if you have multiple versions of your logo or other imagery, you can quickly engage with your audience and check in advance what your future customers think about your new corporate identity.

All it takes is getting the crowd to do a simple side-by-side comparison and rank your options from best to worse with detailed explanations, should they be required. This endeavour can take as little as 10 minutes.

When it comes to rebranding, agreeing on the right design can be challenging

Case study 3: Handl and data extraction

Y-Combinator backed Handl is an algorithm-based solution that helps large companies analyze, categorize, and retrieve customer information from scanned documents in a matter of seconds.

The problem is that for Handl to be able to offer reliable results and maintain a competitive edge, the algorithm needs to be updated and retrained on a regular basis. This means lots of incoming data labeled consistently and accurately.

Handl used crowdsourced approach to data labeling to its advantage – the company now collects over 4500+ daily tasks, with each new document requiring 6-8 minutes to process, and all that at an affordable cost that’s 50% lower than the in-house route.

Handl collects over 4500+ daily tasks

Takeaway: AI Data Labeling

In today’s world, ML products and data-driven business decisions require a stable, fast, and reliable flow of high-quality data. Having a wide range of options—both unsupervised and supervised—to support your business by ensuring transparent production pipelines, high labeling accuracy, and rapid delivery is crucial.

This feature was originally republished from hackernoon.

Aelia Vita

Related Topics
  • AI
  • Algorithm
  • Brand
  • Data
  • Data Extraction
You May Also Like
View Post
  • Artificial Intelligence

Microsoft‘s Big AI Ambitions Go Beyond Just OpenAI And ChatGPT

  • February 3, 2023
View Post
  • Artificial Intelligence
  • Technology

Deepfakes: Faces Created By AI Now Look More Real Than Genuine photos

  • February 3, 2023
View Post
  • Artificial Intelligence

GPT-3 In Your Pocket? Why Not!

  • February 3, 2023
View Post
  • Artificial Intelligence
  • Design
  • Engineering

Can AI Replace Cloud Architects?

  • February 2, 2023
View Post
  • Artificial Intelligence

Meet Aiko And Aiden: The World’s First AI Interns

  • February 2, 2023
View Post
  • Artificial Intelligence
  • Technology

Google Scrambles To Catch Up In The Wake Of OpenAI’s ChatGPT

  • January 31, 2023
View Post
  • Artificial Intelligence
  • Technology

9 Ways We Use AI In Our Products

  • January 31, 2023
View Post
  • Technology
  • Tools

Google Cloud Unveils New AI Tools for Retailers

  • January 31, 2023
Stay Connected!
LATEST
  • 1
    Microsoft‘s Big AI Ambitions Go Beyond Just OpenAI And ChatGPT
    • February 3, 2023
  • 2
    Deepfakes: Faces Created By AI Now Look More Real Than Genuine photos
    • February 3, 2023
  • 3
    GPT-3 In Your Pocket? Why Not!
    • February 3, 2023
  • 4
    Can AI Replace Cloud Architects?
    • February 2, 2023
  • 5
    Meet Aiko And Aiden: The World’s First AI Interns
    • February 2, 2023
  • 6
    Google Scrambles To Catch Up In The Wake Of OpenAI’s ChatGPT
    • January 31, 2023
  • 7
    9 Ways We Use AI In Our Products
    • January 31, 2023
  • 8
    Google Cloud Unveils New AI Tools for Retailers
    • January 31, 2023
  • 9
    7 Ways Google Is Using AI To Help Solve Society’s Challenges
    • January 30, 2023
  • 10
    The Ethics Of Machine Learning: Understanding The Role Of Developers And Designers
    • January 30, 2023

about
About
Hello World!

We are liwaiwai.com. Created by programmers for programmers.

Our site aims to provide materials, guides, programming how-tos, and resources relating to artificial intelligence, machine learning and the likes.

We would like to hear from you.

If you have any questions, enquiries or would like to sponsor content, kindly reach out to us at:

[email protected]

Live long & prosper!
Most Popular
  • 1
    8 Best Human Behaviour Datasets For Machine Learning
    • January 30, 2023
  • 2
    Built With BigQuery: How To Accelerate Data-Centric AI Development With Google Cloud And Snorkel AI
    • January 29, 2023
  • 3
    What Kind Of Future Will AI Bring Enterprise IT?
    • January 29, 2023
  • 4
    Prompt Engineering For ChatGPT And Generative AI
    • January 29, 2023
  • 5
    AI Might Be Seemingly Everywhere, But There Are Still Plenty Of Things It Can’t Do—for now
    • January 27, 2023
  • /
  • Artificial Intelligence
  • Machine Learning
  • Robotics
  • Engineering
  • About

Input your search keywords and press Enter.