Getting Started with Jaspersoft
Who is this article for?Data scientists analysing Coruson data.
Jaspersoft Admin permission is required
To support the analytic needs of Coruson customers Ideagen provides access to Tibco’s Jaspersoft BI solution. This allows customers to build views, reports and dashboards around their data and perform analysis.
This article outlines:
1. What is Jaspersoft?
Jaspersoft is a third-party business intelligence tool that is provided as part of a cloud service hosted on Amazon Web Services (AWS) within a secure virtual private cloud (VPC) network. Jaspersoft has no access to Coruson directly and instead uses a data warehouse.
The VPC ensures the Coruson infrastructure is isolated from other AWS customers. Monitoring, logging and threat detection systems are hosted on a separate VPC with secure access to the Coruson infrastructure for the specific purpose. This is controlled through network segregation (Control document ref: INFOSEC-00047 - Development Systems and Access Control).
The data warehouse cannot be accessed directly by customers. A secured VPN and AWS access key is required to connect to the warehouse. Connection is also restricted to Ideagen approved IP addresses.
For full details on the Coruson infrastructure please see document ref: ENL-00003 - Coruson Architecture Overview.
2. The process
Coruson data is stored in a mongoDB format. As such, the data must be translated into an easy to query format for use with Jaspersoft.
An ETL process will extract and transform the data, storing it in a Postgres database. This is the data warehouse.
The ETL process commences 5 minutes after the last one completed. The time taken for the ETL to complete varies from dataset to dataset.
Example:
- ETL process commences at 9:00am UTC.
- Process takes 1 hour to complete, finishing at 10:00am UTC.
- ETL process begins again at 10:05am UTC.
3. Data access
Each customer has their own Jaspersoft repository which has a unique link to their dataset within the warehouse.
Customers are provided with two users; a Jaspersoft administrator account that has administrative privileges within the repository it resides, and also an integration user which is used to access the reports and dashboards from within Coruson. The administrator account can then provision additional users within the repository as required.
Important Note: The administrator account provides full control over users, domains and reports within its repository only. This means they will be unable to view the warehouse as a whole or view any additional datasets.
Each repository is made up of:
- Users who can be given a variety of access levels within the repository.
- A domain, which contains links to the data in the PostgreSQL database.
- Ad-hoc views, pulling the data into a form that can be viewed and the data manipulated.
- Reports, which use the views to display and analyse the data.