All posts by Jon Peck

Traveling Salesman by API

Traveling Salesman is one of the classic NP-Hard problems: finding the optimal solution can take a long time, but there are some great shortcuts available which come close! Algorithmia now brings you a fast, near-optimal way to find the fastest route through multiple cities, thanks to the power of Genetic Algorithms and easily-accessible APIs. Read More…

Rapidly Extract Information from Public Websites

We have a lot of fun, heavy-hitting algorithms in our marketplace: deep-learning tools like Image Tagger and pipelining mechanisms such as Video Metadata Extraction are designed to bring the power of Machine Learning to your app via easy-to-use APIs.

But sometimes, all you need to do is extract some simple information from publicly available sources: for example, finding all the email addresses of a company’s C-Suite, or summarizing the topic pages of a FAQ. You could accomplish some of it with a Python script and some RegEx magic, but that wouldn’t bring the benefits of a remote API: datacenter-grade network connections, multiple IPs, and distributed parallel processing. And it wouldn’t give you access to more complex algos such as automatic tagging or sentiment analysis. With Algorithmia, you get all the benefits of the cloud without having to build and host your own workers, plus the combined experience of our growing network of experienced algorithm developers. Read More…

Acquiring Data for Document Classification

The Document Classifier is a powerful tool for generating keyword predictions for your documents, whether they are chat transcripts, emails, historical documents, scientific abstracts, or any number of other possible sources.

However, your predictions will only be as good as the dataset you’ve trained on. Since the Document Classifier algorithm supports retraining, this can be done in chunks right on the Algorithmia platform: grab a bunch of data, train your classifier on it, then come back immediately after or weeks later to add even more training data. Read More…

Building a Timeline of your Video: Automatically Identify Objects, Sequence Times, and Integrate with Timeline.js

When we implemented InceptionNet, a microservice to detect and label objects (features) in photos, we knew it would be helpful. Then, we built out VideoMetadataExtraction, a video pipeline which allows you to run feature-detection algorithms (and others) on an entire video. This allowed for some really powerful activities — like automatically scanning through home security footage to find all the cars of a specific make & model, or stripping out all the nudity-containing scenes of a movie to make a G-rated version.

Today, we’ll go further by showing you how to visualize all the features in your video, thanks to the VideoTagSequencer and Timeline.js, a beautiful JavaScript library for displaying timelines on the web.

If you want to skip directly to the demo, please do. but come on back for a full breakdown of the integration pipeline and code samples! Read More…