Most Viewed Content:

Ford will launch a new electric truck under a new product line

REcent report prove that Ford is planning on launching a new electric vehicle that will come with a new nameplate.

Samsung ships first 3nm Gate-All-Around chips to cryptocurrency mining industry customers as promised

This Monday, July 25, Korean electronics giant Samsung began...

Xiaomi 12S and 12S Pro Revealed With A New Processor

Xiaomi is now on a device launching spree as it is set to launch the Xiaomi 12S and 12s Pro with a new Snapdragon processor.

Xiaomi’s new technology: convert the table in the picture into an Excel file

Lei Jun, the founder of Xiaomi, introduced a set of table recognition algorithms developed by Xiaomi, which efficiently and accurately converts tables in pictures into editable Excel files, significantly improving the user experience. Table recognition refers to the recognition of table structure and text information in pictures into a data format that can be understood by computers, which has wide practical value in office, business and education scenarios, and has been a hot issue in document analysis research.

Around this problem, Xiaomi has developed a set of table recognition algorithms, which efficiently and accurately extracts the tables in pictures and converts them into editable Excel files. The algorithm has been successfully implemented in Xiaomi 10S series, MIX Fold 2, and other flagship models, you can access the experience from Album – More – Table Recognition, or swipe to enter.

Form detection algorithm

Xiaomi said that the table detection algorithm mainly extracts the table area accurately from the picture and corrects the table to get a flat table picture for the next step of table recognition.

The table recognition algorithm mainly extracts the table structure and table text content from the picture, and then combines these information effectively to output an editable Excel table.

Form detection has the following difficulties: on the one hand, the algorithm and memory on the cell phone are limited, on the other hand, the requirements for the form detection results are very high, the form often contains other text around it, and if the detection results are not accurate, it will have a negative impact on the recognition results later.

Xiaomi’s table detection algorithm will detect both the table area and the four corner points of the table, and through perspective transformation and our self-developed anti-distortion algorithm to get a flat table with only the table area, the effect is shown in the figure.

Since the algorithm runs on the cell phone side, it needs to ensure the running speed and model size, Xiaomi adopts a very lightweight one-stage detection framework, backbone using shuffleNetV2.

regressing the key point information while detecting the table box to facilitate perspective correction of the table, and using Wing loss instead of L1 loss to make the key point regression more accurate.

In terms of data, the algorithm is used to mine a large amount of form detection data from public data at low cost, which significantly improves the form detection effect. The final model size is about 1M and runs smoothly on Xiaomi phones.

Table Recognition Algorithm

The table recognition algorithm runs on the server side and contains the following main modules: text detection, text recognition, table structure prediction, cell matching, alignment algorithm, and Excel export.

The current mainstream approach is to represent tables in HTML hypertext, and then encode the HTML to predict the HTML sequence and the corresponding coordinate information.

This method has achieved good results on open source datasets, and Ping An Technology of China and Baidu have also adopted this scheme, but too many tags in HTML lead to error-prone table structure recognition.

To address the shortcomings of this method, we adopt a new encoding method for tables, which can represent tables of arbitrary structure with only four tags, greatly improving the accuracy of table structure recognition.

Table recognition is accelerated by using the Faster transformer inference framework during deployment, which officially claims that Xiaomi’s inference speed is improved by about 20 times, significantly improving user experience.

Summary

The algorithm can efficiently and easily extract tables from images, greatly improving office efficiency. Xiaomi said that engineers will continue to improve the recognition experience of document-based images in Xiaomi phones.

Latest

Apple Watch Pro body size comparison: 49mm body with more than 2-inch screen

Sources say the upcoming Apple Watch Pro will be...

Solar car maker Lightyear has raised $85 million and is ready for production

Lightyear, a solar car startup from the Netherlands developing...

Xbox Elite 2 Controller will support customization in the Xbox Design Lab

Microsoft will bring the Xbox Elite 2 grip to...

Samsung M8 smart monitor is now receiving a massive price discount

The Samsung M8 smart monitor is currently on sale at a massive discount currenlty ongoing at the company's JD online store.

Newsletter

spot_img

Don't miss

Apple Watch Pro body size comparison: 49mm body with more than 2-inch screen

Sources say the upcoming Apple Watch Pro will be...

Solar car maker Lightyear has raised $85 million and is ready for production

Lightyear, a solar car startup from the Netherlands developing...

Xbox Elite 2 Controller will support customization in the Xbox Design Lab

Microsoft will bring the Xbox Elite 2 grip to...

Samsung M8 smart monitor is now receiving a massive price discount

The Samsung M8 smart monitor is currently on sale at a massive discount currenlty ongoing at the company's JD online store.

Intel Raptor Lake iGPU run scores revealed: graphics performance is remarkable

Although Intel needs to wait until the 14th generation...
Threza Gabriel
Threza Gabrielhttps://www.techgoing.com
TechGoing is a global tech media to brings you the latest technology stories, including smartphones, electric vehicles, smart home devices, gaming, wearable gadgets, and all tech trending.
spot_imgspot_img

AMD Zen4 manual bucking after a magical scene occurred: 5GHz frequency temperature plummeted 37 ℃

Yesterday, a tipster claimed that the QS/ES test piece of AMD Zen4 had heat buildup, and that the 170W TDP version easily touched 230W...

Honor X40 Series will launch in China on September 15

The Honor X40 Series will get new entries on September 15, at this time we are not certain about the specifications of this coming series.

Shadowstone Insta360 official announcement: a new generation of panoramic sports cameras

On September 5, the world-renowned smart video brand Shadowstone Insta360 released a teaser trailer, announcing that the new product will be officially released at...