Meet the Algorithmia Team at the Strata Data Conference in NYC, September 26-28, 2017

Location: Javits Center in New York City
Date: September 26-28, 2017

The theme of Strata is “turning data into advantage”—here at Algorithmia, we help your team get the models from your data science efforts into production. Our team will be available to talk through the challenges you’re facing in productionizing your models or finding the right models from our marketplace to meet your project’s needs. Whatever your questions are, we’re here to help.

Quick Facts:

  • Meet us at the Algorithmia Booth #P33
  • We’ll be giving away Algorithmia swag—so stop by and say hi.
  • Get 20% off your conference pass by entering the Algorithmia20 discount code at the registration

Introduction to Character Recognition

This is easy to understand, right?

easy ocr

How about this? A bit harder?

moderate natural

Are you able to decipher this one at all?

hard natural
courtesy of Faris Algosaibi

The first example can be easily recognized by most character recognition algorithms. However, as your text gets progressively more complex, this seemingly simple task becomes more and more difficult for even the best machine learning algorithms to successfully complete. Read More…

Using to Classify Domains in Production

Classify Websites

Modern cyber attacks, such as Botnets and Ransomware, are becoming increasingly dependent on (seemingly) randomly generated domain names. Those domains are used as a way to establish Command & Control with their owners, which is a technique called Domain Fluxing. The recent WannaCry ransomware was famously stopped simply by registering one of those domain names.

The ability to quickly classify a domain name as *safe* or *malicious* is a critical task in the cybersecurity world. It can help alert security experts of any suspicious activity or even block that activity. Such a system will have two requirements:

  • Needs to be accurate, you don’t want to block your users from accessing safe websites
  • Needs to be scalable, able to handle thousands of transactions per second

There are plenty of approaches to this problem, especially in the academic world (S. Yadav – 2010, J. Munro – 2013). The fine folks at also have an excellent code sample we found here. This blog post will briefly describe how H2O’s implementation works and how you can deploy and scale it on Algorithmia. Read More…