Our API is powered up by the Datumbox Machine Learning Framework. It currently offers 14 different functions as part of our Machine Learning platform. All of the following functions use several sophisticated classification techniques and they are accessible via our REST API. To call any of our functions, just sign up for an API key and follow the guidelines as described on the Technical Details. Please note that the API provides a limited number of calls (1000 per day). If you require more calls consider using the Datumbox Machine Learning Framework instead.
The Sentiment Analysis function classifies documents as positive, negative or neutral (lack of sentiment) depending on whether they express a positive, negative or neutral opinion.
The Twitter Sentiment Analysis function allows you to perform Sentiment Analysis on Twitter. It classifies the tweets as positive, negative or neutral depending on their context.
The Subjectivity Analysis function categorizes documents as subjective or objective based on their writing style. Texts that express personal opinions are labeled as subjective and the others as objective.
The Topic Classification function assigns documents in 12 thematic categories based on their keywords, idioms and jargon. It can be used to identify the topic of the texts.
The Spam Detection function labels documents as spam or nospam by taking into account their context. It can be used to filter out spam emails and comments.
The Adult Content Detection function classifies the documents as adult or no-adult based on their context. It can be used to detect whether a document contains content unsuitable for minors.
The Readability Assessment function determines the degree of readability of a document based on its terms and idioms. The texts are classified as basic, intermediate and advanced depending their difficulty.
The Language Detection function identifies the natural language of the given document based on its words and context. This classifier is able to detect 96 different languages.
The Commercial Detection function labels the documents as commercial or non-commercial based on their keywords and expressions. It can be used to detect whether a website is commercial or not.
The Educational Detection function classifies the documents as educational or non-educational based on their context. It can be used to detect whether a website is educational or not.
The Gender Detection function identifies if a particular document is written-by or targets-to a man or a woman based on the context, the words and the idioms found in the text.
The Keyword Extraction function enables you to extract from an arbitrary document all the keywords and word-combinations along with their occurrences in the text.
The Text Extraction function enables you to extract the important information from a given webpage. Extracting the clear text of the documents is an important step before any other analysis.
The Document Similarity function estimates the degree of similarity between two documents. It can be used to detect duplicate webpages or detect plagiarism.
The Datumbox API is a web service which allows you to use our tools from your website, software or mobile application. The API gives you access to all of the supported functions of our service. In this page you will find all the information that you need in order to use our API, fully implemented code samples and the latest API Documentation.
Our Web Service uses "REST-Like" RPC-style operations over HTTP POST requests with parameters URL encoded into the request and its response is encoded in JSON. It is designed to be easy to use and you can implement it in any model computer language that allows you generating web requests.
The current version of the API is 1.0v. In order to use the API you must Register for a Datumbox account and get your API Key from your profile. Once you have your key you immediately start generating API requests on our service. Below you can find a list of code samples and the API Documentation which can help you use our API within minutes: