Requirements for installing

DataHub is an Enterprise Content and Data Integration platform capable of managing file/data transfer and synchronize operations across a myriad of different storage management platforms at scale. It offers significant business flexibility around all aspects of Enterprise Content and Data Integration including:

  • Large-scale file migration
  • Enterprise file analysis
  • Classification, compliance, and actions/outcome management
  • Multi-system hybrid/sync

Solution Architecture

The DataHub platform is built on a pluggable, content–streaming architecture that enables highly–automated file/data transfer and synchronization capabilities for up to billions of files. File bytes stream in from one connection endpoint (defined by the administrator), across a customer owned and operated network and/or cloud service, and then stream out to a second connection endpoint. Content can also flow bidirectionally across two endpoints rather than solely from an "origin" to a "destination."

DataHub is a "security–neutral" model, fully utilizing your existing infrastructure and security schema. Content and data bytes only exist in DataHub memory in chunks, which are immediately streamed from one endpoint (through DataHub) and out to another endpoint. Both incoming and outgoing bytes are transferred using the most secure protocol available for each connector. For example, DataHub will use SSL or TLS encryption along with OAuth (token–based) authorization for most cloud service connectors.

Platform Components

Server/Manager Admin Console

The server component of DataHub runs as a background service on at least one server. The DataHub service can also be deployed on multiple server nodes in a cluster.

Database Server

DataHub utilizes an embedded PostgreSQL database as part of its standard installation process. Additional options include Microsoft SQL Server or hosted PostgreSQL. DataHub requires network connectivity with the database to function properly in all cases.

Agents

DataHub agents are designed to serve three distinct use cases deployed on:

  • User Desktop Agents - Local, user desktops
  • Remote Agent server(s) - remote office NFS, eliminating the need for a VPN solution

Connectors

DataHub's Platform Connectors provide storage endpoint integration. Each storage connector has been carefully developed to implement all of the available security features via its native API. The connectors themselves can be deployed to execute in the context of the primary DataHub Server or in the context of one or more DataHub Remote Sites depending on the deployment model selected.

DataHub stores connection information such as the authenticated security token, URL, and UNC, and in some cases such as network file–shares, the user name and password are encrypted within the database. DataHub Connectors utilize all the default ports based on the platform's required communication protocol.

System Locale

Region Settings must be set to English (United States).

  1. Go to Control Panel > Clock and Region.
  2. Select Region > Administrative tab.
  3. Select Change system locale.
  4. Set current system local to "English (United States)"

DataHub Processing Servers (1 - # Servers)

  • CPU cores: 8
  • RAM: 32GB (minimum)
  • OS disk: 500GB (minimum)
  • OS: Windows Server 2016

If using a cloud server, the following templates are recommended: 

  • AWS: m4.2xlarge 

  • Azure: D8S_V3 

SQL Server (1 Server or Availability Group)

  • CPU cores: 16
  • RAM: 64GB (minimum)
  • OS_Vol: 100GB+ SSD – redundant / fault tolerant
  • OS: Windows Server 2016
  • Data_Vol1: 1TB premium SSD
  • Data_Vol2: 1TB premium SSD (optional)
  • Software/templates: SQL Server 2016 SP1 (or higher) enterprise

If using a cloud server, the following templates are recommended: 

  • AWS: m4.4xlarge 

  • Azure: D16S_V3 

Supported Operating Systems

Server / Manager & Remote Sites

  • Windows Server 2019

  • Windows Server 2016

  • Windows Server 2012 R2

Supported Databases

Server / Manager and Remote Sites

  • Embedded PostgreSQL 10.10-2+
    • If you are using a PostgreSQL database, it must be configured to use English for messages.
  • SQL Server 2016+
    • Database Planning and Tuning Concepts should be implemented; this includes noting the database name, instance and port number for SQL access if different than defaults

Browser Support

Supported: 

  • Chrome

Note:

DataHub Platform installation is not supported in FireFox, Edge, or Safari.

DataHub Platform application works as expected in FireFox, Edge, and Safari. Minor issues may be observed.

DataHub Platform is not supported in any version of Internet Explorer.

Open Port 9090

Ensure your Windows firewall is disabled on your VM (or the more advanced option is to open port 9090).

Administrator Password Requirements

Passwords must meet the following requirements: 

  • At least 8 characters
  • At least one uppercase letter (A-Z)
  • At least one digit (0-9)
  • At least one non-alphanumeric character (!@#$%)
  • Cannot contain the username

Languages

DataHub Platform Application

  • English
The login screen will display in English (UK).

Other Recommendations

DataHub strongly recommends leveraging the following tools:

Related Links