Scalable Log Management for an International Financial Institution

Realizations
1

Challenge

An international financial institution required a robust log management solution to monitor hosts effectively across its dev, UAT, and production environments. The key challenge was to ensure real-time collection of metrics and logs while maintaining high availability, security, and scalability. Additionally, the system needed to support georedundancy across multiple locations (Warsaw and Frankfurt) and enable seamless user access management.

Challenge Image
2

Solution

To address this challenge, we implemented three independent clusters of the Sorigo Log Manager platform, designed for the dev, UAT, and production environments. The solution leveraged Metricbeat and Filebeat for comprehensive log and metric collection.

Key implementation steps included:

  • Infrastructure Deployment: 12 servers supporting two production clusters with georedundancy.
  • Centralized Monitoring: Logs and performance metrics are collected and processed in our platform, ensuring real-time visibility.
  • SSO Integration: Single Sign-On (SSO) authorization via the Azure portal for improved access management.
  • Ongoing Maintenance: We actively manage permissions, add new data sources, monitor cluster resources, and optimize configurations.

Additionally, as part of maintenance, we implemented:

  • Cluster configuration optimization
  • User and permissions management
  • Error debugging
  • Schema Registry management
  • Configuration and management of data streams
3

Result

The implementation provided the financial institution with a highly scalable and secure log management solution. Key benefits included:

  • Improved observability: Real-time monitoring of infrastructure across multiple locations.
  • Enhanced access control: Streamlined authentication through Azure SSO.
  • Operational efficiency: Optimized cluster performance and seamless data flow management.
  • Georedundancy assurance: Increased resilience by distributing clusters across two geolocations.

With this solution, the institution achieved reliable, centralized monitoring while ensuring compliance and operational excellence.

Improved observability
Enhanced access control
Operational efficiency
Georedundancy assurance