About Backbone

Backbone allows you to extract structured SQL-like databases from unstructured text and tables: Adobe PDF, Microsoft Office documents, HTML pages, etc.

Backbone allows you to gather a large amount of disparate signals (natural language text, tables and even figures) scattered all over your documents and databases into a single massive fully-searchable repository.

Backbone aims to provide a flexible yet secure way to organize, consume and share your data.

How it works

Backbone makes it easy to explore your data. It is designed for building practical, enterprise-grade exploratory data analysis processes in a matter of hours - not months.

1

Ingest

Extract data and metadata from a variety of sources and file formats: HTML, JSON, CSV, Adobe PDF, Microsoft Office, SQL databases and much more.

2

Index

Let the heavy lifting begin: Backbone automatically mines, structures, indexes and replicates your data in real-time.

3

Extract

Build query focused datasets and export them as CSV or GraphML. Build yourself custom dashboards using Backbone's Web API or our Java SDK.

4

Explore

Explore your datasets using our Web interface, Backbone-UI, or load your data into third-party applications such as Cytoscape.

Key features

  • Visibility Labels

    Databases generally grant user access permission at the table level. We leverage Accumulo’s cell-level security in order to ensure data are only available to the right people.

    This cell-level access control provides flexibility that prohibits access to data in accordance with policies, while maximizing access to other data.

  • Extractors

    An extractor is a user-defined expression whose goal is to target a subset of a data stream. Extractors help you specify which data are relevant and which are not.

    For example, it is possible to only extract and index email addresses found in a data stream.

  • Fingerprints

    A fingerprint is a user-defined label applied to all documents sharing a common set of properties. Fingerprints help you organize documents and make searching easier.

    For example, it is possible to automatically apply a "crypto.pgp" label to all PGP encrypted content found in a data stream.

Pricing & Features comparison

  • Usage
  • Number of Teams
  • Number of Users / team
  • Number of Repositories
  • Data Sources
  • Small Number of Files (manual upload)
  • Huge Number of Files (batch upload via SFTP)
  • SQL Database
  • Continuous Data Ingest (with Apache NiFi)
  • File Formats
  • Max. File Size
  • CSV
  • JSON
  • Microsoft Office (doc, xls, ppt)
  • Adobe PDF
  • Other File Formats (html, xml, txt, rtf, odf, ...)
  • Features
  • Visibility Labels
  • Extractors
  • Fingerprints
  • REST API
  • SDK for Java
  • Analytics
  • Search Engine
  • CSV Export
  • JSON Export
  • GraphML (compatible with Cytoscape)
  • Security
  • At the Application Level
  • At the Database Level
  • Support
  • Upgrades
  • Email Support

Express

Hosted

  • Number of Teams

    1

  • Number of Users / team

    3

  • Number of Repositories

    Unlimited

  • Small Number of Files (manual upload)
  • Huge Number of Files (batch upload via SFTP)
  • SQL Database
  • Continuous Data Ingest (with Apache NiFi)
  • Max. File Size

    100MB

  • CSV
  • JSON
  • Microsoft Office (doc, xls, ppt)
  • Adobe PDF
  • Other File Formats (html, xml, txt, rtf, epub, odf, ...)
  • Visibility Labels
  • Extractors
  • Fingerprints
  • REST API
  • SDK for Java
  • Search Engine
  • CSV Export
  • JSON Export
  • GraphML (compatible with Cytoscape)
  • At the Application Level
  • At the Database Level
  • Upgrades

    All

  • Email Support

    5 Business Days

Professional

Hosted

  • Number of Teams

    Unlimited

  • Number of Users / team

    Unlimited

  • Number of Repositories

    Unlimited

  • Small Number of Files (manual upload)
  • Huge Number of Files (batch upload via SFTP)
  • SQL Database
  • Continuous Data Ingest (with Apache NiFi)
  • Max. File Size

    50Go

  • CSV
  • JSON
  • Microsoft Office (doc, xls, ppt)
  • Adobe PDF
  • Other File Formats (html, xml, txt, rtf, epub, odf, ...)
  • Visibility Labels
  • Extractors
  • Fingerprints
  • REST API
  • SDK for Java
  • Search Engine
  • CSV Export
  • JSON Export
  • GraphML (compatible with Cytoscape)
  • At the Application Level
  • At the Database Level
  • Upgrades

    All

  • Email Support

    Next Business Day

Appliance

Your own servers

  • Number of Teams

    Unlimited

  • Number of Users / team

    Unlimited

  • Number of Repositories

    Unlimited

  • Small Number of Files (manual upload)
  • Huge Number of Files (batch upload via SFTP)
  • SQL Database
  • Continuous Data Ingest (with Apache NiFi)
  • Max. File Size

    50Go

  • CSV
  • JSON
  • Microsoft Office (doc, xls, ppt)
  • Adobe PDF
  • Other File Formats (html, xml, txt, rtf, epub, odf, ...)
  • Visibility Labels
  • Extractors
  • Fingerprints
  • REST API
  • SDK for Java
  • Search Engine
  • CSV Export
  • JSON Export
  • GraphML (compatible with Cytoscape)
  • At the Application Level
  • At the Database Level
  • Upgrades

    1 Year

  • Email Support

    Next Business Day

Keep me updated

If you want to be kept in the loop, subscribe below.