On this page
Unlike the default installation, where all the Fyrefuse dependencies are provided together in the same Docker Compose with the simplest configuration possible, the custom installation assumes that you already have an Apache Spark and optionally Trino clusters set up along with an object storage and a metastore.
On the machine or VM where Fyrefuse is deployed, you need the following:
Additionally, you need the following services already configured and running:
If you haven’t set up these components yet, follow their official documentation:
Fyrefuse provides a Docker Compose configuration along with an env file for the initial configuration.
More specifically, it contains:
If you are using the provided database image, no additional configuration is required.
If you're using an existing PostgreSQL instance, you must manually create two empty databases one for the Fyrefuse backend and one for the Hive Metastore along with a dedicated user with all privileges.
Create a new project and ensure the main
branch exists; this is where Fyrefuse will
store its files.
Generate a project access token with the following scopes:
Fyrefuse has a set of environment variables for configuring the access parameters and general settings of the external Fyrefuse services.
Variable Name | Description | Default |
---|---|---|
BE_VERSION | The Backend version to be deployed | latest |
FE_VERSION | The Frontend version to be deployed | latest |
DATABASE_USER | Specify the Fyrefuse database username | fyrefuse_user |
DATABASE_PASSWORD | Specify the Fyrefuse database password | |
DATABASE_HOST | Specify the Fyrefuse database hostname or ip | localhost |
DATABASE_PORT | Specify the Fyrefuse database port | 5432 |
DATABASE_NAME | Specify the Fyrefuse database name | fyrefuse_database |
DATABASE_SCHEMA | Specify the Fyrefuse database schema | fyrefuse_schema |
API_URL | The base api url to be used by Fyrefuse frontend | |
API_LOGGER | The endpoint to be used by Fyrefuse frontend to enable notifications | |
WS_URL | The endpoint to be used by Fyrefuse frontend to enable Websockets | |
PIPELINE_LOGGER_LOGGER_URL | The api url to be used by the FEM to send pipelines’ logs | |
PIPELINE_LOGGER_REPORT_URL | The api url to be used by the FEM to send pipelines’ reports | |
GITLAB_PRIVATE_ACCESS_TOKEN | Default access token for retrieving the FEM from gitlab (in case of spark standalone deployment engine) | |
FEM_BUILD_NAME | FEM version that will be used for running the Fyrefuse’s pipelines | |
FEM_BUILD_LINK | FEM build link | |
METASTORE_DB_NAME | Specify the Metastore database name | metastore |
METASTORE_DB_SCHEMA | Specify the Metastore database schema | public |
METASTORE_URI | Specify the Metastore connection uri | thrift://<metastore_hostname> |
METASTORE_WAREHOUSE_DIR | Specify the Metastore warehouse dir | s3a://datalake/warehouse/ |
METASTORE_DB_CONNECTION_STRING | Specify the Metastore database connection string | jdbc:postgresql://localhost:5432/metastore |
METASTORE_DB_DRIVER | Specify the Metastore database driver | org.postgresql.Driver |
METASTORE_DB_USERNAME | Specify the Metastore database username | fyrefuse_user |
METASTORE_DB_PASSWORD | Specify the Metastore database password | |
DATALAYER_HTTP_MODE | Specify the S3 object storage http mode | http |
DATALAYER_ENDPOINT | Specify the S3 object storage endpoint | |
DATALAYER_ACCESS_KEY | Specify the S3 object storage access key | |
DATALAYER_SECRET_KEY | Specify the S3 object storage secret key |
Download the Docker Compose file and navigate to its directory in the shell.
Configure the environment variables according to your setup.
Login to Fyrefuse registry with your credentials to get the access to the Fyrefuse’s images.
docker login -u <username> -p <personal_access_token> registry.gitlab.com
Access your configured S3 object storage and create a bucket named "datalake". Inside this bucket, create a folder named "warehouse".
Run the Docker Compose file:
docker compose up -d
Once the installation process is complete, you can start using Fyrefuse (available at localhost:8001) by creating your first project.
For instructions, see Create Your First Project.