All posts by James Sutton

Introduction to Character Recognition

This is easy to understand, right?

easy ocr

How about this? A bit harder?

moderate natural

Are you able to decipher this one at all?

hard natural
courtesy of Faris Algosaibi

The first example can be easily recognized by most character recognition algorithms. However, as your text gets progressively more complex, this seemingly simple task becomes more and more difficult for even the best machine learning algorithms to successfully complete. Read More…

Train a Machine to Turn Documents into Keywords, via Document Classification


Figuring out the meaning of a document was once a very hard problem for computers to solve… even for humans, understanding the complexity of natural language can be tricky!

Fortunately, there are some great tools that can help address those concerns. The Document Classifier turns your existing documents and associated keywords into a model which can be used to predict the most appropriate keywords for new blocks of text. Read More…

Introduction to Video Tag Sequencing

Video Metadata Extraction

Sifting through unlabelled videos can be difficult and time-consuming. Even for the most seasoned analyst, fatigue leads to mistakes. Whether you’re trying to detect anomalies in mission-critical infrastructure — or you just want to find all of the segments in your vacation videos that contain ducks — we have a microservice that can help reduce the workload.

What is the Video Tag Sequencer? How does it work?

The VideoTagSequencer is an algorithm takes the time series point data generated from VideoMetadataExtraction and converts it into an index of detected labels and sequences contained in the video. In a nutshell, it takes frame-by-frame results, and converts them into a list of time ranges at which each result occurs. Read More…