Algorithmia Blog - Deploying AI at scale

Introduction to Character Recognition

This is easy to understand, right?

easy ocr

How about this? A bit harder?

moderate natural

Are you able to decipher this one at all?

hard natural
courtesy of Faris Algosaibi

The first example can be easily recognized by most character recognition algorithms. However, as your text gets progressively more complex, this seemingly simple task becomes more and more difficult for even the best machine learning algorithms to successfully complete. Read More…

Using to Classify Domains in Production

Classify Websites

Modern cyber attacks, such as Botnets and Ransomware, are becoming increasingly dependent on (seemingly) randomly generated domain names. Those domains are used as a way to establish Command & Control with their owners, which is a technique called Domain Fluxing. The recent WannaCry ransomware was famously stopped simply by registering one of those domain names.

The ability to quickly classify a domain name as *safe* or *malicious* is a critical task in the cybersecurity world. It can help alert security experts of any suspicious activity or even block that activity. Such a system will have two requirements:

  • Needs to be accurate, you don’t want to block your users from accessing safe websites
  • Needs to be scalable, able to handle thousands of transactions per second

There are plenty of approaches to this problem, especially in the academic world (S. Yadav – 2010, J. Munro – 2013). The fine folks at also have an excellent code sample we found here. This blog post will briefly describe how H2O’s implementation works and how you can deploy and scale it on Algorithmia. Read More…

Traveling Salesman by API

Traveling Salesman is one of the classic NP-Hard problems: finding the optimal solution can take a long time, but there are some great shortcuts available which come close! Algorithmia now brings you a fast, near-optimal way to find the fastest route through multiple cities, thanks to the power of Genetic Algorithms and easily-accessible APIs. Read More…