Nationwide Data Platform for a Major Bank Strengthens Foundational Capabilities
A state-owned bank (Bank C) initiated a bank-wide IT strategy to support its business development, strengthen foundational management, optimize engineering implementation, and improve project delivery—aiming to become a top-tier large retail commercial bank. To achieve this, it needed to accelerate the construction of its big data infrastructure, enhance data governance, and boost the development and delivery capabilities of its data products—fully advancing into the Big Data 3.0 era.
The big data platform warehouse focuses on future application needs on the one hand, and on the other hand makes refined use of existing technologies to continuously refine and improve the quality of data, provide management with powerful support for operational decision-making, and realize the goal of easy-to-use data.
Through the construction of the system, a set of perfect and scientific data models that can cope with multiple business areas are built, and an enterprise-level data warehouse for bank-wide applications is established. At the same time, based on the actual application of data in various business departments, a unified data portal is constructed, which will support the bank's demands for operation and management, risk management and control, marketing, and data analysis in the coming years.
The project adopted ORIGIEN's integrated development module and functional components from the Data Asset Management Platform. It included an embedded enterprise-level data model and a comprehensive indicator system, helping the bank rapidly build its data warehouse and establish a business-aligned indicator framework. The platform’s Data Service Module provided a unified data access portal with a multi-level access structure to meet diverse data service needs.
In the big data platform warehouse system, both the integration layer and the foundation layer are built based on the ten major thematic models of the financial sector. These are further customized and developed in accordance with the specific characteristics of banking data, forming a dual-framework paradigm architecture of “logical model + physical model.”
For the logical data model, the main entity models are decomposed according to indicators such as data volume, usage frequency, and data update frequency. A dual-entity approach is adopted in logical model design to ensure both the standardization of the logical model and the timeliness required for large-scale banking data.
For the physical model, based on metrics such as data volume and batch processing duration, physical partitioning and model preprocessing are carried out. This enhances both the efficiency and timeliness of data processing.
In the common processing layer of the big data platform warehouse system, the bank’s fundamental indicators are uniformly processed and calculated across five dimensions: agreement, customer, channel, organization, and product, thereby providing standardized data services.
For data indicators with the same business meaning, the common processing layer ensures a unified and unique standard is applied during implementation. The processing specifications and indicator names are incorporated into the bank’s data standards framework and released across the organization in a consistent manner.
In the data mapping process of the big data platform warehouse system, data code values are aligned with the bank-wide published data standards, field names are standardized, and identical code values from different source systems are consolidated. This reduces data redundancy and improves both data usage efficiency and user experience.
Fields with poor data quality, high null rates, or ambiguous meanings are filtered out and flagged for feedback, thereby enhancing the quality of data available for downstream applications.
By adopting open-source frameworks and developing innovative visualization components, a data visualization development framework and standards tailored for the banking sector have been established. Within this framework, map elements from the original development environment have been enhanced to include visualized displays of customer migration trajectories. In addition, multiple models such as graph theory, time series, random forest, and text mining are leveraged to develop diverse data products for model R&D.
Develop data products using a variety of models such as graph theory, time series, random forest, and text mining.
High Quality & Efficiency:
Since launch, the big data platform has supported numerous core systems and analytics projects across Bank C. With over 100 cluster nodes and more than 1PB of storage, it processes over 10,000 tasks per day with 99.97% annual system availability. It supports over 50 downstream applications, including regulatory reporting, data middle platform, eight data marts, portals, and management dashboards.
Significant Cost Reduction & Efficiency Gains:
The warehouse's shared processing layer organizes data from a business perspective, reducing duplicate calculations and storage overhead, and significantly lowering indicator processing costs.
User-friendly interaction of service applications:
realize the data product display mode of visual interactive experience, support the rapid implementation and unified management of data products, providing reference and support for users to realize their business intentions through more intuitive display of results, and realizing goal-oriented visual display of the actual needs of data products, and other functions.
Since its launch, the Big Data Portal Application System for Enabling Business Operation Analysis has served a number of first-tier branches within the bank as well as multiple business departments at the head office. The data products built by the system have provided strong support for the bank's business development, operation management, precision marketing, risk prevention and control.
-
ORIGIEN Data Asset Platform
The ORIGIEN Data Asset Platform integrates data governance, data middle platform, and data tools into a single data intelligence foundation to enhance data governance and unlock the value of data assets.
Experience the Product -
Big Data Platform/Data Lake
Enables integrated collection, management and consumption of middle platform source data, automating raw data ingestion and governance
Experience the Product -
ORIGIEN Data Service Platform
The ORIGIEN Data Service Platform provides a unified data service exit, realizing unified market management of data, effectively lowering the barriers to data opening, and ensuring the security of data opening.
Experience the Product -
ORIGIEN Data Development Platform
The ORIGIEN Data Development Platform is a one-stop visual big data development and processing platform, integrating development, debugging, deployment, and operations.
Experience the Product -
Indicator Management Platform
Driven by the principle of "business-led, business-serving," the platform builds a standardized indicator data system designed for seamless collaboration and reuse across business and technology teams.
Experience the Product -
Real-time Data Warehouse Development Platform
Enables rapid implementation of unified stream-batch processing capabilities to deliver highly reliable, low-latency data services
Experience the Product -
External Data Management Platform
Delivers secure, efficient and unified external data services to power risk control, customer acquisition and marketing scenarios, enhancing business synergy while accelerating digital transformation.
Experience the Product -
Data Middle Platform
Enables high-level data integration and capability reuse through four key dimensions: platform consolidation, data governance, service development, and scenario activation to achieve data-as-a-service.
Experience the Product