An integrated search solution powered by ML

by Andrés Villarroel and Santiago Vazquez, Cloud Engineering

AWS Textract is an AWS service that allows developers to detect, extract text (even if it is handwritten), forms and tables from PDF, PNG, and JPG files.

The service is trained with deep learning, and it is incrementally adjusted by AWS video and image recognition services with millions of images and videos daily.

It is used for

Benefits

A2I (Augmented AI)
Textract sample

NLP (natural language processing) is a component of artificial intelligence, whose aim is to convert text into structured data so that it comprehends human speech while reproducing it by means of analysis, understanding, and generation of natural language.

Used for:

AWS Comprehend

It is an NLP service, trained and administered by AWS with million data points collected from diverse sources. Its learning can be enhanced with AutoML and customized data training.

Used for:

Benefits

Project Architecture

From the need to create an integrated search solution that allows extracting information, classify, comprehend and index it for later exploitation, we worked on the following: https://github.com/aws-samples/amazon-textract-comprehend-OCRimage-search-and-analyze

Issues

Final Project Architecture for the solution

FLOW

DEMO DEPLOYMENT

For the analysis, 2 options were considered:

Result search

2. Sentiment analysis of song lyrics and define whether it was positive, negative, neutral or mixed.

Analysis of results

USE CASES

Depending on settings, this search solution could be useful for:

Improvement can be achieved in the code (convert from CF to TF), integration with other ML services (Alexa, Lex, Translate, Transcribe). Besides, indexation has to be revised (e.g. frequency of revision of indexation)

AWS solutions like Comprehend, Textract, S3 and Lambda are very versatile and accessible. Their applications could be customized to different industries and organizations, built to retrain themselves and optimize operations and analytics.

Want to join an innovative cloud team? Contact us.

--

--

We are an AWS Premier Consulting Partner company. Since 2009 we’ve been delivering business outcomes and we want to share our experience with you. Enjoy!

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
EDRANS Stories

We are an AWS Premier Consulting Partner company. Since 2009 we’ve been delivering business outcomes and we want to share our experience with you. Enjoy!