ITBI Solution

Data Lake.

The ITBI Data Lake is a powerful feature designed to enhance your analytical capabilities by providing a flexible and accessible repository for your SMF data. The Data Lake streamlines the process of accessing and utilizing your data, making it easier to conduct in-depth, meaningful analyses.

Access and Utilization

It allows you to write simple scripts to extract and analyze SMF interval data, enabling deep-dive analysis tailored to specific needs. All of your data is decoded and stored in the Data Lake, ensuring that it is readily available for analysis at any time. The Data Lake can be accessed using a variety of tools, including SQL, R, SAS, and Python, offering users the flexibility to work with their preferred platforms.

 

Contents of the Data Lake

The Data Lake contains three types of data arrangements:

  1. Raw SMF Files: These are the original SMF files provided by the customer, stored exactly as received.
  2. Raw SMF in Tables: This includes nearly all fields from all supported SMF types, mostly decoded, and organized following the standard IBM SMF naming conventions.
  3. View Tables: The most useful variables have been pre-selected and organized into view tables and organized following the standard MXG naming conventions. This setup reduces the time needed to find key variables for generating reports, ultimately saving you time and making your data analysis process more efficient and user-friendly.

Applications of the Data Lake

The Data Lake serves various functions, from reporting on recently received data within minutes of its arrival at SMT Data to analyzing fields and details not included in the ITBI Data Cubes, which supply the standard reports with data. It supports event-based data reporting and the implementation of complex logic or calculations that are not feasible within the standard BI reports. In addition, it facilitates the creation of graphics or formatting not supported by the BI reports.

Specific use cases include detailed reporting on fields and events obscured in aggregated data and leveraging existing SAS/MXG expertise. These applications, particularly effective for those with a solid understanding of both SMF and MXG structures, maximize the utility of the Data Lake.

 

  • Direct and Immediate Data Access

    Access a diverse range of data directly through the ITBI portal using Amazon Athena, enabling quick ad hoc queries and prototyping for faster insights and decision-making without delays.

  • Comprehensive Data Inclusion

    From structured MXG-like tables to raw SMF files, the Data Lake covers a comprehensive range of data types, ensuring user access to both high-level summaries and detailed information for various analytical needs.

  • Versatile Integration and Programming Support

    Support for remote ODBC connections allows seamless integration with programming languages like SAS/MXG, enhancing the reuse of existing programs and expertise while facilitating integration with other systems for expanded utility.