Liwaiwai Liwaiwai
  • /
  • Artificial Intelligence
  • Machine Learning
  • Robotics
  • Engineering
    • Architecture
    • Design
    • Software
    • Hybrid Cloud
    • Data
  • About
Liwaiwai Liwaiwai
  • /
  • Artificial Intelligence
  • Machine Learning
  • Robotics
  • Engineering
    • Architecture
    • Design
    • Software
    • Hybrid Cloud
    • Data
  • About
  • Artificial Intelligence

Alibaba Supports Digitization Of Chinese Ancient Books With Advanced AI Technology

  • May 19, 2021
  • relay

The digitization of Chinese classics is challenging, as Chinese ancient characters are complex. Throughout history, one Chinese character might have several variants and written forms. Digitizing Chinese ancient books through optical character recognition (OCR) not only facilitates machine reading but also gives a new life to numerous ancient books for public peruse.

Alibaba DAMO Academy (DAMO), the global research institute of Alibaba, started a new project to digitize Chinese classics together with the Alibaba Foundation, the Library of University of California, Berkeley, Sichuan University, National Library of China, and Zhejiang Library. The program aims to digitize and aggregate ancient Chinese books and convert scanned images into texts for open access. This way, libraries in China and abroad can work together to make their ancient Chinese books freely available to the world.

Jeff Zhang, Head of Alibaba DAMO Academy, said: “Alibaba will continue to invest in resources and cutting-edge technology to support such projects. Making ancient books available to the public is in line with our values and belief in ‘Tech for Change’. We believe that technology can play a critical role in preserving precious cultural relics and heritage, and we look forward to working with libraries in China and abroad to make this happen.”

The first batch of Chinese classics in this joint effort comes from the C.V. Starr East Asian Library of University of California, Berkeley, one of the largest academic libraries with rich holdings of Chinese ancient books. 200,000 digital pages of ancient books are now on display including woodblock printed books and manuscripts from the Song Dynasty and Yuan Dynasty, a period in ancient China dating back over 1,000 years ago. Other materials include digital pages of an original volume of Siku Quanshu 四库全书, The Complete Works of Chinese Classics from the Qing Dynasty.

Read More  How Companies Are Actually Using AI In Everyday Practices

UC Berkeley Library provided scanned pages and metadata while DAMO used optical character recognition (OCR) to turn the scanned images into text. Furthermore, DAMO teamed up with scholars in Sichuan University to develop an AI model for single-character indexing, automatic character grouping, and various forms of machine learning such as self-supervised learning and few shot learning. This model yields an accuracy rate of 97.5% in recognizing ancient characters. The new model can now recognize 30,000 ancient Chinese characters with efficiency, surpassing the speed of human reading by thirtyfold.

“Alibaba will make this AI system for the machine-reading of Chinese ancient books available to the public soon,” Jeff added.

relay

Related Topics
  • Alibaba
  • Alibaba DAMO
  • Alibaba DAMO Academy
  • Ancient Books
  • Digitization
You May Also Like
View Post
  • Artificial Intelligence
  • Software
  • Technology

Bard And ChatGPT — A Head To Head Comparison

  • March 31, 2023
View Post
  • Artificial Intelligence
  • Platforms

Modernize Your Apps And Accelerate Business Growth With AI

  • March 31, 2023
View Post
  • Artificial Intelligence
  • Technology

Unlocking The Secrets Of ChatGPT: Tips And Tricks For Optimizing Your AI Prompts

  • March 29, 2023
View Post
  • Artificial Intelligence
  • Technology

Try Bard And Share Your Feedback

  • March 29, 2023
View Post
  • Artificial Intelligence
  • Data
  • Data Science
  • Machine Learning
  • Technology

Google Data Cloud & AI Summit : In Less Than 12 Hours From Now

  • March 29, 2023
View Post
  • Artificial Intelligence
  • Technology

Talking Cars: The Role Of Conversational AI In Shaping The Future Of Automobiles

  • March 28, 2023
View Post
  • Artificial Intelligence
  • Tools

Document AI Introduces Powerful New Custom Document Classifier To Automate Document Processing

  • March 28, 2023
View Post
  • Artificial Intelligence
  • Design
  • Practices

How AI Can Improve Digital Security

  • March 27, 2023

Leave a Reply

Your email address will not be published. Required fields are marked *

Stay Connected!
LATEST
  • 1
    Bard And ChatGPT — A Head To Head Comparison
    • March 31, 2023
  • 2
    Modernize Your Apps And Accelerate Business Growth With AI
    • March 31, 2023
  • 3
    Why Your Open Source Project Needs A Content Strategy
    • March 31, 2023
  • 4
    From Raw Data To Actionable Insights: The Power Of Data Aggregation
    • March 30, 2023
  • 5
    Effective Strategies To Closing The Data-Value Gap
    • March 30, 2023
  • 6
    Unlocking The Secrets Of ChatGPT: Tips And Tricks For Optimizing Your AI Prompts
    • March 29, 2023
  • 7
    Try Bard And Share Your Feedback
    • March 29, 2023
  • 8
    Google Data Cloud & AI Summit : In Less Than 12 Hours From Now
    • March 29, 2023
  • 9
    Talking Cars: The Role Of Conversational AI In Shaping The Future Of Automobiles
    • March 28, 2023
  • 10
    Document AI Introduces Powerful New Custom Document Classifier To Automate Document Processing
    • March 28, 2023

about
About
Hello World!

We are liwaiwai.com. Created by programmers for programmers.

Our site aims to provide materials, guides, programming how-tos, and resources relating to artificial intelligence, machine learning and the likes.

We would like to hear from you.

If you have any questions, enquiries or would like to sponsor content, kindly reach out to us at:

[email protected]

Live long & prosper!
Most Popular
  • 1
    Introducing GPT-4 in Azure OpenAI Service
    • March 21, 2023
  • 2
    How AI Can Improve Digital Security
    • March 27, 2023
  • 3
    ChatGPT 4.0 Finally Gets A Joke
    • March 27, 2023
  • 4
    Mr. Cooper Is Improving The Home-buyer Experience With AI And ML
    • March 24, 2023
  • 5
    My First Pull Request At Age 14
    • March 24, 2023
  • /
  • Artificial Intelligence
  • Machine Learning
  • Robotics
  • Engineering
  • About

Input your search keywords and press Enter.