Software Requirements

Operating System

GBDS can be installed on the following operating systems:

  • CentOS 7

  • Red Hat 7

  • Oracle Linux 7

  • Oracle Linux 8

Hadoop

GBDS is based on Apache Hadoop version 3.1, which is a collection of open-source software. Hadoop provides multi-purpose tools for parallel and scalable systems. Currently, GBDS is integrated with the following Hadoop components:

  • Ambari: Provisioning, management and monitoring of a Hadoop cluster.

  • Kafka: Distributed streaming system for real-time data integration.

  • Zookeeper: Coordination system that enables synchronization across a cluster.

  • HBase: Non-relational database management system.

  • HDFS: Distributed file system designed to run on commodity hardware.

Database

GBDS uses two different database systems, relational and non-relational:

  • HBase For biometric images and templates.

  • MySQL for metadata, such as transactions, exceptions, criminal cases, biometric profiles and unresolved latents.

circle-check

Local Load Balancing

The model for extracting templates from an image requires more resources than the biometric comparison between templates and is performed in the GBDS API handler. To optimize hardware usage, GBDS is highly parallelized and each node in a cluster must be able to receive API requests, if configured to do so, therefore, the use of a load balancer is recommended to distribute requests equally among the nodes, aiming to achieve the best performance. This way, no node will be overloaded in the cluster.

It is possible to use load balancers in both hardware and software. A simple software solution for load balancing is HAProxy, an open-source and free software that provides load balancing and server proxy tools.

Last updated

Was this helpful?