ThinkingDb2!: 2020

Wednesday 18 November 2020

Generating Reports from Db2 historical monitoring (db2histmon) data

- David Sciaraffa, Software Engineering Manager – IBM Db2

(This blog is a follow-up to my previous blog which describes the Db2 Historical Monitoring (db2histmon) framework.)

The Db2 Historical Monitoring (db2histmon) scripts (available here) collect a broad range of Db2 and Operating System diagnostic data, and retain this data for a period of time, allowing for basic triage of many types of issues. Diagnostic information is collected at various intervals (ranging from 1min to 1hour, depending on the kind of data), and can be easily customized and expanded. The scripts will collect data only while the database is activated, which can help to reduce the generation of unnecessary redundant data.

This blog provides an overview of the new report generation script.

The new report generation script can be used to scrape previously generated db2histmon data collections and produce summary reports in multiple flavours, similar to the reports produced by the MONREPORT module. The reports are useful for for analyzing various metrics, and highlighting statistical outliers, from previous time periods.

Downloading the db2histmon report script:

The full set of db2histmon scripts can be downloaded from the open-source Github repo: https://github.com/IBM/db2histmon.
The report.py script itself is located in the 4_report/ sub-folder:

(db2inst1@myhost1) /home/db2inst1/db2histmon-master/4_report/
$ ls

1_setup/
2_loader/
3_quickparser/
4_report/

Pre-requisites:

The scripts can be deployed in any Unix or Windows environment where Db2 is supported.
Python2 or higher.
The python pandas, argparse, and numpy modules are also required to execute the report.py script.

Using pip to install the pandas, argparse, and numpy modules:

(db2inst1@myhost1) /home/db2inst1/
$ pip install argparse
Collecting argparse
Downloading argparse-1.4.0-py2.py3-none-any.whl (23 kB)
Installing collected packages: argparse
Successfully installed argparse-1.4.0
(db2inst1@myhost1) /home/db2inst1/
$ pip install numpy
Collecting numpy
Downloading numpy-1.19.4-cp38-cp38-win_amd64.whl (13.0 MB)
|| 13.0 MB 656 kB/s
Installing collected packages: numpy
Successfully installed numpy-1.19.4
(db2inst1@myhost1) /home/db2inst1/
$ pip install pandas
Collecting pandas
Downloading pandas-1.1.4-cp38-cp38-win_amd64.whl (8.9 MB)
|| 8.9 MB 1.1 MB/s
Collecting python-dateutil>=2.7.3
Downloading python_dateutil-2.8.1-py2.py3-none-any.whl (227 kB)
|| 227 kB 1.7 MB/s
Installing collected packages: python-dateutil, pandas
Successfully installed pandas-1.1.4 python-dateutil-2.8.1

Types of reports:

The report.py script is currently capable of generating three unique types of reports:

DBSUMMARY - Provides a summary of metrics for the entire database, similar to the MONREPORT.DBSUMMARY module.

CONNECTION - Provides a summary of metrics for each connection, similar to the MONREPORT.CONNECTION module.

PKGCACHE - Provides a summary of metrics for the top executed statements that are still cached within the package-cache, similar to the MONREPORT.PKGCACHE module.

A report will display various metrics and data points for every data collection time period (or limited only to the time periods specified in the arguments), as described later.

Historical data which is used to generate the report:

The report.py script will scrape historical monitoring data that was previously collected by the Db2 Historical Monitoring (db2histmon) framework.

Please see the 'The Data Collection and Data Archive folders' section of my previous blog which describes the Db2 Historical Monitoring (db2histmon) framework, if you're unsure where this historic monitoring data may reside.

For reference, I'm including a screen-shot showing the contents of the db2histmon data collection folder:

Usage and options:

The report.py script has the following usage options:

(db2inst1@myhost1) /home/db2inst1/db2histmon-master/4_report/

$ python report.py -h

usage: report.py [-h] [-r {dbsummary,connection,pkgcache}]

[-st START_TIME] [-et END_TIME] [-p PERIOD]

[-s] [-m MEMBERS [MEMBERS ...]]

[-ah APPLICATION_HANDLES [APPLICATION_HANDLES ...]] path

positional arguments:
path path where collection directories are stored

optional arguments:
-h, --help show this help message and exit
-r, --report {dbsummary,connection,pkgcache},
name of report to be generated
-st START_TIME, --start_time START_TIME
display collections after this time, format can be (YYYY-mm-dd HH:MM:SS) or copied from a
collection (YYYY-mm-dd-HH-MM-SS.ffffff)
-et END_TIME, --end_time END_TIME
display collections before this time, format can be (YYYY-mm-dd HH:MM:SS) or copied from a
collection (YYYY-mm-dd-HH-MM-SS.ffffff)
-p PERIOD, --period PERIOD
display collections at a lesser frequency, for example an interval of 3 will show every third
collection
-s, --stats show min, max, mean, std for each series of values
-m MEMBERS [MEMBERS ...], --members MEMBERS [MEMBERS ...]
filter certain reports by member
-ah APPLICATION_HANDLES [APPLICATION_HANDLES ...], --application_handles APPLICATION_HANDLES [APPLICATION_HANDLES ...]
filter certain reports by application handle

Use the --report option to specify the type of report to generate, either DBSUMMARY, CONNECTION, or PKGCACHE.
The last argument specifies the folder path where the db2histmon historical monitoring data resides.
I'll explain the other options within the examples further below.

Example 1. Generate a DBSUMMARY report (on all available historical monitoring data)

(db2inst1@myhost1) /home/db2inst1/db2histmon-master/4_report/

$ python report.py -r DBSUMMARY /path/to/db2histmon/data_files/

Example 2. Generate a report on a subset of time-intervals, by using the START_TIME and END_TIME options

In some cases, the quantity of db2histmon historical monitoring data collections available or archived is very large and may make the reports too large or unreadable.

We can use the --START_TIME and --END_TIME options to limit the data collection time-intervals that we wish to generate a report on.

In this example, we wish to include all data collections that were generated after 2020-09-29 02:20 but before 2020-09-29 04:30:

(db2inst1@myhost1) /home/db2inst1/db2histmon-master/4_report/

$ python report.py -r DBSUMMARY --start_time "2020-09-29 02:20:00" --end_time "2020-09-29 04:30:00" /path/to/db2histmon/data_files/

Add caption

Example 3. Generate a report with time-intervals condensed into larger time PERIODs.

We can also use the --PERIOD option to combine time-intervals from the data collection set into larger sets, such that fewer (but longer duration) time-intervals are displayed in the report.

In this example, many of our original db2histmon historical monitoring data collections are approximately 5minutes apart (as an be seen in the previous report's example above). By specifying the --PERIOD 3 option, we aggregate the data which would normally be reported in three separate time-periods into one time-period:

(db2inst1@myhost1) /home/db2inst1/db2histmon-master/4_report/

$ python report.py -r DBSUMMARY --period 3 /path/to/db2histmon/data_files/

Example 4. Generate a report showing MIN, MAX, MEAN statistical values.

Understanding the statistical values associated with a particular metric can be useful when looking for anomalies in the data set. In this example, I specify the --stats option to include the statistics for each reported value:

(db2inst1@myhost1) /home/db2inst1/db2histmon-master/4_report/

$ python report.py -r DBSUMMARY --period 3 /path/to/db2histmon/data_files/

IBM Db2 - Technical (Lab) Advocate Newsletter - November 2020



IBM Db2 & Db2 on Cloud Technical (Lab) Advocate Newsletter November 2020
	*Announcement -* *IBM Db2 Version 11.5 Modification Pack 5 is Now Available!* Mod Pack Summary of Changes: https://ibm.co/3nHLIn7 Db2 11.5.5.0 Download Page: https://www.ibm.com/support/pages/node/6368787 Db2 11.5.5.0 Full APAR Fix List: https://www.ibm.com/support/pages/node/1102545 Db2 11.5.5.0 Client Drivers Download Page: https://www.ibm.com/support/pages/node/323035 Some Key Highlights: • Advanced Log Space Management - Improvements for mirrored logging support and monitoring. • Automated failover to HADR standby with Pacemaker (on Linux for on-premises and non-containerized cloud deployments) for production environments. • Support for Snowflake data sources through federation JDBC and ODBC connections. • Support for MySQL Community Edition data sources through Federation ODBC connection • Automatic Recompress feature enhancement for columnar tables. • pureScale - Automated periodic cluster topology verification, for daily cluster health checks. • purescale - Improved Remote Direct Memory Access (RDMA) connectivity validation, using Db2 Lightspeed RDMA ping • JDBC 4.3 specification API support; OpenJDK 13 runtime support; Odata enhancement. • pureScale - Concurrent online fix pack update • Enhanced query performance against column organized tables. • Db2 Historical Monitoring (db2histmon) - generate MON_REPORT style reports on historical monitor data. • New schema level access control of database objects. • UPDATE, DELETE statement SKIP LOCKED Data Support for CDE Tables • Machine Learning Optimizer technology preview refresh • and much more: https://ibm.co/3nHLIn7 *Webinar Series -* *Db2 Offering Management Webinar Series (including Db2 v11.5.5 Deep Dive)* The Db2 Offering Management team provides a series of informative webinars covering Db2 roadmaps, strategies, new feature and capability overviews, and much more: Unboxing Db2 11.5.5 - Get an overview of all features and enhancements in Db2 11.5.5. Performance and Security Updates - Hear about the latest enhancements, including topics as schema level authorization, compression improvements and compact varchar enhancements. Db2 Modern Workloads - Hear about latest enhancements focused on supporting modern workloads, including: ability to handle mixed workloads in a high performing manner, integrated data virtualization, ability to leverage a multi-modal data engine including JSON and Graph including demos on Db2's NoSQL and NewSQL capabilities. Db2 Containers and Cloud Pak for Data - Hear about the latest enhancements for containerized deployment, and why it makes business and technical sense to move to Db2 on CP4D. Availability and Resiliency Updates - Hear about the latest enhancements in Availability and Recoverabilitt, with the introduction of automated HADR with Pacemaker. Db2 on Cloud - If you are considering Db2 Cloud services for your workloads, this session will cover both Db2 services - Db2 on Cloud (OLTP) and Db2 Warehouse on Cloud. We will also discuss the strategy and roadmap so this will be a great session if you are strategizing to move to Db2 on Cloud soon or are already on it. Data Management Console for Db2 Administrators - Learn about the latest updates in the Data Management console to help Db2 Administrators manage Db2 environments. New Packaging and Pricing - This session will cover the upcoming editions of Db2, its pricing and entitlement migration paths from previous versions to new versions. Register for access to recordings, as well as upcoming webinars: https://mailchi.mp/8613d131efe0/db2-nebula-webinar-landing-page *Webinar -* *Db2 is Developer Inclusive* In this session, the team will delve into topics such as: Integration with programming languages; Employing in-database machine learning; Leveraging REST APIs. The team will also provide a preview of Db2 11.5.5! When: 11th November, 11:00AM EST https://ibm.webcasts.com/starthere.jsp?ei=1392215&tp_key=2d29881a0c *Webinar -* *Db2 is AI ready* Please join Quentin Presley (Development Manager) and Roger Bitar (Offering Manager) details how enterprises can become smarter with a data management system that is AI-ready. During this session, learn about how Db2 augments, embeds, and infuses AI on a flexible and scalable architecture offering governance to facilitate innovative data estates and infrastructure. https://ibm.webcasts.com/starthere.jsp?ei=1389272&tp_key=6e769cdbb6 *Problem Determination -* *Db2 Historical Monitoring (db2histmon) - New Report Generation* To aid with problem determination and improve the availability of diagnostic and monitoring data at the first occurrence of a problematic event, the new Db2 Historical Monitoring (db2histmon) scripts and framework are now available. These scripts deploy a broad set of monitoring and diagnostic data collection, at varying intervals, and manage the archival and purging of historic data. Data collection sets can by customized as needed. A new report generation script can create MONREPORT style reports based on historical monitoring data. Download the db2histmon scripts: https://github.com/IBM/db2histmon Learn more about deploying and using the db2histmon scripts: https://bit.ly/31uc580 Learn more about the db2histmon report generation scripts: https://bit.ly/36RMc3f *Conferences & Summits & User Groups* Over the years, IDUG has become known for hosting dynamic conferences packed with the cutting-edge content you need. This year will be no exception. Be sure to make your plans to attend. With great sessions, keynote speakers, workshops, and more, you won't want to miss this event. Upcoming Events: https://www.idug.org/events/calendar IDUG EMEA Db2 Tech Conference - November 16-19, 2020 IDUG Latin America Virtual Db2 Tech Seminars - December 8-9, 2020 IDUG Australasia Conference - March 15-19, 2020 Upcoming Regional User Groups: Find Other Local Db2 User Groups in your area: https://www.idug.org/rug *Webinar -* *New developments in Db2 pureScale* Join Toby Haynes (Senior Manager, IBM Db2) as he provides an overview of recent enhancements to the Db2 pureScale, improving on the already impressive extreme availability architecture. https://www.dbisoftware.com/blog/db2nightshow.php?id=800 *New Offering -* *Db2 on Cloud Standard and Enterprise Plans* Db2 on Cloud offers new Standard and Enterprise plans that includes the following features: Independent scaling of compute and storage Three high-availability nodes plan spanning multiple availability zones (for high-availability plans) Self-service managed backup with point-in-time restore Private endpoint connectivity All customers on legacy plans (Flex or Precise Performance) must move to a Standard or Enterprise plan. For more information, see the following blogs: https://www.ibm.com/cloud/blog/announcements/ibm-db2-on-cloud-elastic-plans https://www.ibm.com/cloud/blog/announcements/deprecation-of-the-db2-on-cloud-legacy-plans-and-availability-of-new-replacement-plans *Webinar -* *Intro to Db2 Managed Service for Enterprises* Learn about managed Db2 service on cloud for OLTP workloads. With Managed Database as a Service, users can focus on applications, thus leaving the management to the service provider. IBM Db2 on Cloud Managed Service's robust features provide the depth to handle customer workload requirements such being highly available, scalable, on demand, and being secure. https://video.ibm.com/playlist/633288 *Webinar -* *Intro to Db2 Warehouse on Cloud* Db2 Warehouse on Cloud is the Elastic Software-as-a-Service (SaaS) offering from IBM that runs on multiple cloud platforms. Get to know the offering and learn about the new and upcoming features that makes Db2 Warehouse on Cloud support larger warehouse environments with better Disaster Recovery (DR) capabilities. https://video.ibm.com/recorded/128072408 *Webinar -* *Db2 for SAP* Learn why Db2 is best suited for SAP workloads, which offer tight integration, accelerated performance, and reliability—all on a wide choice of platforms. https://video.ibm.com/playlist/633288/video/127806532 *Webinar -* *IBM Replication – Data Delivery in 2020* IBM Gold Consultant and replication expert Frank Fillmore delivered an information packed presentation covering an overview of IBM replication solutions that are available today. Whether you are a current IBM replication user, investigating replication for a new edge system, or working with a competitor’s solution, this webinar will help you understand what IBM replication solutions support today, and the latest sources, targets and deployment models to help your organization successfully deliver fast and accurate replicated data. https://bit.ly/31gqKDw *Webinar -* *Data and AI pre-configured: IBM Cloud Pak for Data System vs. the competition* IBM Cloud Pak® for Data System, a preconfigured “cloud in a box,” helps companies modernize their data and quickly get AI projects up and running. Join experts from IBM and analyst firm Cabot Partners to learn why IBM Cloud Pak for Data System is a leader in hyper-converged platforms for data and AI. This webinar will walk through a comprehensive competitive analysis of IBM Cloud Pak for Data System against AWS Outposts, Google Anthos, Microsoft Azure Stack/Hybrid Cloud and Oracle Cloud at Customer. https://bit.ly/3dnDTxa *_____________________________________________________________________________________________* *Support -* *Overview of IBM Support Framework, Guidelines, and Escalation methods* Please review the IBM Support guidelines for a description of the IBM support framework; how to get setup for IBM support; methods to help yourself via watson search; and how to escalate a support case. https://www.ibm.com/support/pages/ibm-support-guide *Roadmaps -* *Db2 Development Roadmaps are Accessible to Everyone* Curious about the feature and function committed to upcoming versions of Db2? You can now explore live development roadmaps for Db2 and the IBM Analytics family of products. This content is live and subject to change. https://ibm-analytics-roadmaps.mybluemix.net/ *Db2 Community -* *Follow the Db2 Developer and Administrator Community* Share. Solve. Do More. https://developer.ibm.com/data/db2/ *Follow-us! -* *Stay up to date on the latest news from the Db2 team* IBM Support Community: (https://www.ibm.com/mysupport/) IBM DB2 DeveloperWorks (http://bit.ly/sig_Blog) IBM DB2 Twitter page (https://twitter.com/db2_support) IBM Youtube Channel (http://bit.ly/sig_YouTube) developerWorks Answers (forum): https://developer.ibm.com/answers/topics/

Tuesday 30 June 2020

IBM Db2 - Technical (Lab) Advocate Newsletter - June 2020

IBM Db2 & Db2 on Cloud Technical (Lab) Advocate Newsletter June 2020
	*Announcement -* *IBM Db2 Version 11.5 Modification Pack 4 ('Nebula') is Now Available!* Mod Pack Summary of Changes: https://ibm.co/2NHllxv Db2 11.5.4.0 Download Page: https://www.ibm.com/support/pages/node/6241724 Db2 11.5.4.0 Full APAR Fix List: https://www.ibm.com/support/pages/node/1102545 Db2 11.5.4.0 Client Drivers Download Page: https://www.ibm.com/support/pages/node/323035 Some Key Highlights: Db2 REST API Automatic INDOUBT Resolution for DPF Adaptive workload management for WORKLOAD=ANALYTICS CPU Controls Columnar Page based VARCHAR compression Query Performance (early aggregation; early distinct; full outer join; join residual predicate support; NULL=NULL) Memory Stability (COMPACT VARCHAR; Full Outer Join) Advanced Log Space Management Ability to block operations which put table in reorg-pending state Optimizer Version Control Enhance Support for Unicode Data in a Non-Unicode Database Faster Database Activation Queries can kkipped locked data SSO with JWT Token Authentication Caching Security-Enhanced (SE) Linux support on RHEL 7 & 8 Enhancement to improve compression for numeric data stored in string data types in columnar tables Federation: Parallelism ; Numerous JDBC connectors Tech Previews: Graph Database;, ML Optimizer; Automated HADR with Pacemaker; Containerized Deployments: Db2U on RedHat OpenShift standalone for both Db2 and Db2 Warehouse; Db2 Community standalone docker container on DockerHub (non-production) And much more: https://ibm.co/2NHllxv *Webinar Series -* *Db2 Offering Management Webinar Series (including Db2 v11.5.4 Deep Dive)* The Db2 Offering Management team provides a series of informative webinars covering Db2 roadmaps, strategies, new feature and capability overviews, and much more: Overview of Db2 Editions and Entitlement migration paths from older editions to new. Overview of Db2's strategic direction and future Roadmap. Overview of Db2's Containerization Strategy for Docker and Red Hat Open Shift. Overview of Db2 REST Service, Python UDF and Drivers. Deep Dive into Db2 v11.5.4 Nebula release. We will go through exciting new features. Deep Dive into Db2 Graph Capabilities. Deep Dive into Db2's Machine Learning capabilities. Deep Dive into Db2's new Enterprise readiness capabilities. Register for access to recordings, as well as upcoming webinars: https://mailchi.mp/aee32565e7c6/db2-nebula-webinar-landing-page *Announcement -* *IBM Db2 on Cloud - Enterprise Plan* Whether you are shifting workloads to cloud or augmenting your on-premises business, IBM Db2 on Cloud has never been more ready to take on your toughest, mission critical workloads. Starting June 24, 2020, you can purchase the brand-new Enterprise plan. Some of the key features are... Online scaling of storage and compute. Zero downtime maintenance. Leverages Multi-Zone Region (MZR) support for a 99.99% high availability!!! 10TB of free backup storage for up to 14 days of backups Self-service managed backup with point-in-time restore (Watch for news about our upcoming Standard plan, available later this summer.) https://www.ibm.com/cloud/blog/announcements/db2-on-cloud-announces-new-enterprise-high-availability-plan *Webinar -* *The new face of Db2: Cloud-Native and AI ready* Piotr Mierzejewski (Director, Db2) described the future directions in Db2 11.5 for AI and the cloud. There is a lot to unpack from this presentation. A must-see! https://www.dbisoftware.com/blog/db2nightshow.php?id=805 *Problem Determination -* *Enabling Db2 Historical Monitoring (db2histmon) - Continuous diagnostic data collection and archival* To aid with problem determination and improve the availability of diagnostic and monitoring data at the first occurrence of a problematic event, the new Db2 Historical Monitoring (db2histmon) scripts and framework are now available. These scripts deploy a broad set of monitoring and diagnostic data collection, at varying intervals, and manage the archival and purging of historic data. Data collection sets can by customized as needed. Download the db2histmon scripts: https://github.com/IBM/db2histmon Learn more about deploying and using the db2histmon scripts: https://bit.ly/31uc580 *Blog -* *How-To: Running HA IBM DB2 on Kubernetes* IBM Cloud Kubernetes Service is a managed Kubernetes offering to deliver powerful management tools, an intuitive user experience, and built-in security and isolation to enable rapid delivery of applications—all while leveraging IBM Cloud Services, including cognitive capabilities from Watson. The aim of this blog post is to demonstrate a Db2 deployment architecture on Kubernetes using Portworx persistent volumes. Following this post, you’ll understand how to run Db2 in production on Kubernetes. https://www.ibm.com/cloud/blog/how-to-running-ha-ibm-db2-on-kubernetes *Webinar -* *Db2 Resiliency Models and HA Best Practices* Join Dale McInnis (Executive Technical Specialist), as he provides an overview of Db2 resiliency and high availability best practices. https://www.dbisoftware.com/blog/db2nightshow.php?id=796 *Hands-on Lab -* *IBM Db2 on OpenShift* With the move to DevOps and Agile frameworks and the shift towards "hybrid" in public and private cloud, platforms such as IBM Red Hat OpenShift are becoming the standard in application delivery. This lab teaches you how to deploy and work with Db2 in a OpenShift Environment for both OLTP and MPP deployments. The lab includes how to deploy Db2 applications on Node.js and the Use of Db2 External Tables in OpenShift. https://www.ibm.com/cloud/garage/dte/tutorial/db2-openshift-hands-lab/ *Conferences & Summits & User Groups* Over the years, IDUG has become known for hosting dynamic conferences packed with the cutting-edge content you need. This year will be no exception. Be sure to make your plans to attend. With great sessions, keynote speakers, workshops, and more, you won't want to miss this event. *IDUG Db2 Virtual Tech Conference July 20, 21 & 23:* https://www.idug.org/p/cm/ld/fid=2059 *Upcoming Regional User Groups:* Find Other Local Db2 User Groups in your area: https://www.idug.org/rug *Webinar -* *Data Security for Database Administrators* Today, data is the corner stone of any business. As a result, database teams are facing a number of challenges including the increasing size and complexity ofworkloads, moving data and applications to the cloud, and migrating legacy technologies to cloud-based/cloud-native architectures. To successfully navigatethese trends, organizations must account for new operational risks inherent in hybrid multicloud data environments and view data security as an essential aspect of modern database administration. In this session we will discuss the core data security use cases that database management teams need to consider as they build next-generation database platforms. https://bit.ly/2B7mgo2 *_____________________________________________________________________________________________* *Support -* *Overview of IBM Support Framework, Guidelines, and Escalation methods* Please review the IBM Support guidelines for a description of the IBM support framework; how to get setup for IBM support; methods to help yourself via watson search; and how to escalate a support case. https://www.ibm.com/support/pages/ibm-support-guide *Roadmaps -* *Db2 Development Roadmaps are Accessible to Everyone* Curious about the feature and function committed to upcoming versions of Db2? You can now explore live development roadmaps for Db2 and the IBM Analytics family of products. This content is live and subject to change. https://ibm-analytics-roadmaps.mybluemix.net/ *Db2 Community -* *Follow the Db2 Developer and Administrator Community* Share. Solve. Do More. https://developer.ibm.com/data/db2/ *Follow-us! -* *Stay up to date on the latest news from the Db2 team* IBM Support Community: (https://www.ibm.com/mysupport/) IBM DB2 Developerworks (http://bit.ly/sig_Blog) IBM DB2 Chat Bubble (http://bit.ly/sig_forums) IBM DB2 Twitter page (https://twitter.com/db2_support) IBM DB2 Facebook page (https://www.facebook.com/DB2Support) IBM Youtube Channel (http://bit.ly/sig_YouTube) developerWorks Answers (forum): https://developer.ibm.com/answers/topics/

Enabling Db2 Historical Monitoring (db2histmon) - continuous diagnostic data collection and archival

- David Sciaraffa, Software Engineering Manager – IBM Db2

When a problem with Db2 arises, identification of the problem's root cause by IBM Db2 support and development is often directly influenced by the availability of diagnostic information before, during, and after the time-frame of the problem event. When the diagnostic messages in the db2diag.log are insufficient, it is not uncommon for IBM Db2 support or development to provide instructions to enable a specific set of diagnostic data, and wait for a re-occurrence of the problem, in order to begin narrowing the root cause.

The new Db2 Historical Monitoring (db2histmon) scripts (available here) collect a broad range of Db2 and Operating System diagnostic data, and retain this data for a period of time, allowing for basic triage of many types of issues. Diagnostic information is collected at various intervals (ranging from 1min to 1hour, depending on the kind of data), and can be easily customized and expanded. The scripts will collect data only while the database is activated, which can help to reduce the generation of unnecessary redundant data.

By enabling the collection of Db2 Historical Monitoring (db2histmon), you can improve the odds that helpful diagnostic information is available after the first occurrence of a problem.

This blog provides an overview of how to enable and customize the new Db2 Historical Monitoring (db2histmon) scripts.

Downloading the Db2 Historical Monitoring (db2histmon) scripts:

The scripts are available and can be downloaded from the open-source Github repo: https://github.com/IBM/db2histmon

Pre-requisites:

The scripts can be deployed in any Unix or Windows environment where Db2 is supported.
Python2 or higher, as well as the python ibm_db module are required to execute the scripts.
The scripts can be deployed on any relatively modern version of Db2 (Version 10.1, 10.5, 11.1, 11.5).
Db2 DBADM authority is required on the database where the historical monitoring will be deployed.
A C++ compiler (for example g++ on Unix or Visual Studio on Windows) is required (in order for the scripts to use bldrtn to compile the external C++ UDFs which are used by the historical monitoring framework).

Overview of the Db2 Historical Monitoring framework:

The historical monitoring scripts will utilize a set of control tables, procedures and external UDFs to operate, with the DB2 Admin Task Scheduler executing the data collection tasks at the desired frequency (minimum 1min), and will generate diagnostic/monitoring output into the active data collection folder first, and then archived into the data archival folder every hour. Consider the following process flow diagram:

Consider the following high-level process flow diagram:

Setup Folder:

After downloading the scripts to your local database server system, you'll notice three folders, 1_setup, 2_loader, and 3_quickparser:

(db2inst1@hotellnx112) /home/db2inst1/db2histmon-master/
$ ls

1_setup/ 2_loader/ 3_quickparser/

For the purpose of this blog, we'll be focusing on the 1_setup/ folder (which is used to setup and deploy the historical monitoring framework and begin data collection). In a subsequent blog I will provide an overview of the 2_loader script (which is used to easily load the data collections into a database so it can be queried), and the 3_quickparser script (which is used to display the data collections to a terminal window in a columnar format for easier viewing).

Within the 1_setup/ folder, we find three files, and sub-folder named sql/

(db2inst1@hotellnx112) /home/db2inst1/db2histmon-master/1_setup/
$ ls

README.md setup.py sql/ task_details.json

The README.md file contains a technical description of the setup scripts, input options, as well as a brief architectural overview of how the scripts work. The purpose of this blog is to expand on this information with some visual examples.
The setup.py file is the main setup script, described in detail below.
The task_details.json file, described in detail below, contains the definition of each data collection task, and is used by the setup.py script to deploy each task.

Within the 1_setup/sql/ folder, we find the definition of the control tables, procedures, and user-defined functions which the setup.py script will create in the specified database. The details of these files would mostly be useful to developers or administrators who wish to explore and expand on the framework. I do not explore them in detail in this blog.

(db2inst1@hotellnx112) /home/db2inst1/db2histmon-master/1_setup/sql
$ ls

external.C

external.exp

external.sql

proc_collect.sql

tab_dirs.sql

tab_tasks.sql

external.def

external.h

proc_archive.sql

tab_config.sql

tab_errs.sql

Overview of the task_details.json file:

The task_details.json file defines each data collection task, and is used by the setup.py script to deploy each data collection task into the Db2 Admin Task Scheduler.

The default task_details.json file that is included with the scripts looks like so:

(db2inst1@hotellnx112) /home/db2inst1/db2histmon-master/1_setup/
$ less task_details.json

[
{
"collection_name": "ENV_GET_SYSTEM_RESOURCES",
"collection_class": "SQL",
"collection_command": "SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE(SYSPROC.ENV_GET_SYSTEM_RESOURCES()) T",
"collection_freq": "* * * * *",
"collection_level": "1",
"collection_condition": "",
"loader_join_columns": "",
"loader_diff_exempt_columns": "ALL",
"quickparse_summary_columns": "ALL"
},

...etc...

...etc...

{
"collection_name": "VMSTAT",
"collection_class": "SYS",
"collection_command": "vmstat",
"collection_freq": "* * * * *",
"collection_level": "1",
"collection_condition": "UNIX",
"loader_join_columns": "",
"loader_diff_exempt_columns": "",
"quickparse_summary_columns": ""
}

The "collection_name" defines the name of the collection task (this name will be used later to generate the filename into which this data collection will be output, so avoid using spaces or special characters).
The "collection_class" can be either "SYS" or "SQL". If this task is defined as "SYS" then the collection_command is invoked as a system call to the operating system. If this task is defined as "SQL", then the collection_command is invoked as an sql query against the database.
The "collection_command" is the operating system or sql query which generates the desired monitoring or diagnostic information.
The "collection_freq" defines the frequency of data collection, and this is specified in cron format. The maximum frequency is every 1 minute.
The "collection_level" can be either "1" or "2". When the setup.py script is executed to deploy these data collection tasks to the database, the administrator has the option of deploying only a small set of basic data collection (which are those tasks defined with a collection_level value of 1), or an expanded set of data collection (which are those tasks defined with a collection_level value of 2).
The "collection_condition" is used to define the environmental condition under which this data collection task should run. As of the time of this writing, the possible values are "HADR", "PURESCALE", "UNIX", "WINDOWS", or nothing. For example, if a task is defined with a collection_condition value of "HADR", then the task is only executed if an db2 HADR environment is detected. If the collection_condition is nothing, then the task is always executed.
The "loader_join_columns", "loader_diff_exempt_columns", and "quickparse_summary_columns" will be the focus of a subsequent blog.

setup.py usage and arguments:

Use the setup.py --help option to display usage:

(db2inst1@hotellnx112) /home/db2inst1/db2histmon-master/1_setup/
$ python3 setup.py --help

usage: setup.py [-h] [-un] [-pw] [-c] [-uc] [-ut] [-bp] [-cp] [-ap] [-ms]
[-acmd] [-aext] [-lvl]
database

Setup IBMHIST monitoring program. View README for more information.

positional arguments:
database name of database

optional arguments:
-h, --help show this help message and exit
-un , --username username used to connect to database, default is that
of current user
-pw , --password password used to connect to database, default is that
of current user
-c, --cleanup will drop the IBMHIST schema and all of its objects,
and unschedule its tasks
-uc, --update_config_only
will update config values to passed in arguments or
reset to defaults for those not explicitly specified
-ut, --update_tasks_only
will unschedule all old tasks and reschedule to match
with the task_details.json file
-bp , --bldrtn_path specify path for bldrtn script if it is not
automatically found, default:
/home/<current_user>/sqllib/samples/cpp/bldrtn
-cp , --coll_path directory path of data collection, default:
/home/<current user>/sqllib/db2dump
-ap , --arch_path directory path of data archival, default:
/home/<current user>/sqllib/db2dump
-ms , --max_size max size in bytes of collection and archival, default:
1073741824
-acmd , --arch_cmd command used to archive hourly folders with "_src_"
and "_dest_" placeholders, default: tar -caf _dest_
_src_
-aext , --arch_ext extension of archived hourly folders beginning with
".", default: .tar.gz
-lvl , --coll_lvl scope of tasks to schedule (1 will only schedule key
tasks, 2 will schedule more), all tasks scheduled if
unspecified

The [database] name parameter is mandatory and specifies the local database against which the historical monitoring setup will be deployed. If you have multiple databases in your database instance, and wish to deploy historical monitoring on other these databases, you must perform the setup.py script on each one separately.
The --username and --password arguments are used to establish the database connection. These are required if your database is configured to disallow local user IPC connections without a password. (If unspecified, a connection to the database using the current user is used).
The --cleanup option will perform a full removal of all previously created IBMHIST control tables, procedures, and external UDFs, and data collection tasks. Any previous data collection files or archived sub-folders will not be removed.
The --update_config_only option will perform only the creation (or re-creation) of the IBMHIST control tables, procedures, and external UDFs, and configure (or re-configure) any options/arguments that are specified. Note that default values are used for any options/arguments which are not specified, even if these arguments were specified during a previous invocation of the script.
The --update_tasks_only option will perform only the removal of any previously deployed data collection tasks, and will parse of the task_details.json file and (re)deploy all data collections tasks.
(Note, when neither the --update_config_only or --update_tasks_only options are specified, then both operations are always performed).
The --bldrtn_path optional argument specifies the fully qualified path to the sqllib/samples/cpp/bldrtn executable. This is required to compile the external C++ UDFs which perform the collection of operating system 'SYS' data collection tasks. (If unspecified, the default path /home/<current_user>/sqllib/samples/cpp/bldrtn is used).
The --coll_path optional argument specifies the fully qualified path where a data collection sub-folder will be created (every hour) and the monitoring/diagnostic data will be placed into. (If unspecified, the default path /home/<current_user>/sqllib/db2dump/ is used).
The --arch_path optional argument specifies the fully qualified path to the folder where the data collection folder (every hour) will be moved after it is removed from the active coll_path. (If unspecified, the default path /home/<current_user>/sqllib/db2dump/ is used).
The --max_size optional argument specifies the maximum size (in bytes) of archival data to retain within the arch_path. When this limit is breached, hourly data collection folders (starting with the oldest) will be deleted. (If unspecified, the default value 1073741824 bytes (1GB) is used).
The --arch_cmd optional argument specifies the operating system command that is used (every hour) to tar/zip/compress the active data collection folder into the archive folder. A filename (in the format '<dbname>_<yyyy><mm><dd><hh>') is automatically generated and should not be specified. Tokens '_src_' and '_dest_' are used as placeholders in place of the coll_path and arch_path respectively. (If unspecified, the default command 'tar -caf _dest_ _src_' is assumed). For example, if one wanted to use the star unix command, instead of the default tar command, one might specify 'star -c -f=_dest_ _src_'.
The --arch_ext optional argument specifies the extension to tack onto the archived folder's file after the arch_cmd is completed. (If unspecified, the default extension ".tar.gz" is used).
The --coll_level optional argument specifies the level of data collections. If "1" is specified, only the data collection tasks in the task_details.json file with a "collection_level" value of 1 are deployed. If "2" is specified, any data collection tasks with a "collection_level" value of 1 or 2 are deployed.

Example of running the setup.py script:

Here, we execute the setup.py script against database TESTDB1, using all default arguments:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ python3 setup.py testdb1

First, we see the creation of the SYSTOOLSPACE table space (if not already existing), and the determination of environment conditions (UNIX/WINDOWS, HADR, and/or PURESCALE):

Connecting to database: TESTDB1
Creating tablespace SYSTOOLSPACE ...
Determining environment ...
Determining if Windows or Unix ...
Unix detected ...
Determining if pureScale ...
pureScale not detected ...
Determining if HADR ...
HADR not detected ...

Next we see the unscheduling of any previously scheduled tasks:

Unscheduling tasks ...

Next we see the creation (or recreation) of the IBMHIST schema, the creation of the IBMHIST control tables, procedures, and compilation of the external UDFs.

Setting up IBMHIST schema and its objects ...
Creating IBMHIST schema ...
Setting schema to IBMHIST ...
Registering external functions ...
Copying bldrtn script from '/home/db2inst1/sqllib/samples/cpp/bldrtn' to sql/ ...
Building external functions ...
Executing external.sql ...
Registering tables ...
Executing tab_config.sql ...
Executing tab_errs.sql ...
Executing tab_tasks.sql ...
Executing tab_dirs.sql ...
Registering procedures ...
Executing proc_collect.sql ...
Executing proc_archive.sql ...
Removing additional files created during setup ...

Next we see the configuration of settings based on setup.py input arguments (or default values as in this example):

Configuring IBMHIST settings ...
Deleting configurations from IBMHIST.TAB_CONFIG ...
Setting COLL_PATH to '/home/db2inst1/sqllib/db2dump' ...
Setting ARCH_PATH to '/home/db2inst1/sqllib/db2dump' ...
Setting MAX_SIZE to '1073741824' ...
Setting ARCH_CMD to 'tar -caf _dest_ _src_' and ARCH_EXT to '.tar.gz' ...
Setting TASK_DETAILS_PATH to '/home/hotellnx112/db2inst1/db2histmon-master/1_setup/task_details.json' ...

Next we see some validation tests and sanity checks:

Testing IBMHIST functionality ...
Testing IBMHIST.PROC_COLLECT with a dummy SQL task ...
Testing IBMHIST.PROC_COLLECT with a dummy SYS task ...
Testing IBMHIST.PROC_ARCHIVE to archive collection directories ...
Testing IBMHIST.PROC_ARCHIVE to delete archival directories once max size is reached ...

Next we see the parsing of the task_details.json file, and the registration of data collection tasks to the Db2 Admin Task Scheduler:

Scheduling tasks ...
Setting Admin Task Scheduler registry variable ...
Reading from task_details.json file ...
Scheduling task: ENV_GET_SYSTEM_RESOURCES
Scheduling task: MON_GET_AGENT
Scheduling task: MON_GET_CONNECTION
Scheduling task: MON_GET_ACTIVITY
Scheduling task: MON_GET_UNIT_OF_WORK
Scheduling task: MON_CURRENT_SQL
Scheduling task: MON_GET_APPL_LOCKWAIT
Scheduling task: MON_GET_EXTENDED_LATCH_WAIT
Scheduling task: MON_GET_MEMORY_POOL
Scheduling task: MON_GET_MEMORY_SET
Scheduling task: MON_GET_TRANSACTION_LOG
Scheduling task: DB_GET_CFG
Scheduling task: DBMCFG
Scheduling task: ENV_INST_INFO
Scheduling task: ENV_GET_REG_VARIABLES
Scheduling task: MON_GET_EXTENT_MOVEMENT_STATUS
Scheduling task: MON_GET_GROUP_BUFFERPOOL
Scheduling task: MON_GET_INDEX
Scheduling task: MON_GET_LATCH
Scheduling task: MON_GET_BUFFERPOOL
Scheduling task: MON_GET_PAGE_ACCESS_INFO
Scheduling task: MON_GET_PKG_CACHE_STMT
Scheduling task: MON_GET_REBALANCE_STATUS
Scheduling task: MON_GET_SERVERLIST
Scheduling task: MON_GET_TABLE
Scheduling task: MON_GET_TABLESPACE
Scheduling task: MON_GET_UTILITY
Scheduling task: MON_GET_WORKLOAD
Scheduling task: VMSTAT
Scheduling task: IOSTAT
Scheduling task: NETSTAT
Scheduling task: ARCHIVE

Collection will begin in approximately 5 minutes ...

Finally, the setup script is complete. Within about 5mins the Db2 Admin Task Scheduler should begin executing the data collection tasks, at their defined frequency.

Closing connection ...
Done

The Data Collection and Data Archive folders:

Approximately 5mins after executing the setup script, the Db2 Admin Task Scheduler should begin executing the data collection tasks, at their defined collection frequency.

Because I did not explicitly specify a --coll_path for the data collection, the default path within the database's DIAGPATH (usually ~/sqllib/db2dump/) is used, and a folder "IBMHIST_<dbname>" is created. Likewise for the --arch_path, a folder named "IBMHIST_<dbname>_archive" is created:

<db2inst1@hotellnx112> /home/db2inst1/sqllib/db2dump/
$ ls

db2ats_sps_bnd.log

db2diag.log
db2inst1.nfy
IBMHIST_TESTDB1/
IBMHIST_TESTDB1_archive/
stmmlog

Within the data collection folder, we can see that an hourly data collection sub-folder has been created:

<db2inst1@hotellnx112> /home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1/
$ ls

TESTDB1_2020062714/

And within this hourly data collection sub-folder, we see the output of each of the data collection tasks, at their defined frequencies. In this example, I listed the directory contents approximately 9mins after the Db2 Admin Task Scheduler began executing the tasks:

<db2inst1@hotellnx112> /home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1/TESTDB1_2020062714/
$ ls

ENV_GET_SYSTEM_RESOURCES_202006271421.del MON_GET_MEMORY_SET_202006271421.del
ENV_GET_SYSTEM_RESOURCES_202006271422.del MON_GET_MEMORY_SET_202006271422.del
ENV_GET_SYSTEM_RESOURCES_202006271423.del MON_GET_MEMORY_SET_202006271423.del
ENV_GET_SYSTEM_RESOURCES_202006271424.del MON_GET_MEMORY_SET_202006271424.del
ENV_GET_SYSTEM_RESOURCES_202006271425.del MON_GET_MEMORY_SET_202006271425.del
ENV_GET_SYSTEM_RESOURCES_202006271426.del MON_GET_MEMORY_SET_202006271426.del
ENV_GET_SYSTEM_RESOURCES_202006271427.del MON_GET_MEMORY_SET_202006271427.del
ENV_GET_SYSTEM_RESOURCES_202006271428.del MON_GET_MEMORY_SET_202006271428.del
ENV_GET_SYSTEM_RESOURCES_202006271429.del MON_GET_MEMORY_SET_202006271429.del
IOSTAT_202006271421.del MON_GET_PAGE_ACCESS_INFO_202006271421.del
IOSTAT_202006271422.del MON_GET_PAGE_ACCESS_INFO_202006271422.del
IOSTAT_202006271423.del MON_GET_PAGE_ACCESS_INFO_202006271423.del
IOSTAT_202006271424.del MON_GET_PAGE_ACCESS_INFO_202006271424.del
IOSTAT_202006271425.del MON_GET_PAGE_ACCESS_INFO_202006271425.del
IOSTAT_202006271426.del MON_GET_PAGE_ACCESS_INFO_202006271426.del
IOSTAT_202006271427.del MON_GET_PAGE_ACCESS_INFO_202006271427.del
IOSTAT_202006271428.del MON_GET_PAGE_ACCESS_INFO_202006271428.del
IOSTAT_202006271429.del MON_GET_PAGE_ACCESS_INFO_202006271429.del
MON_CURRENT_SQL_202006271421.del MON_GET_REBALANCE_STATUS_202006271421.del
MON_CURRENT_SQL_202006271424.del MON_GET_REBALANCE_STATUS_202006271422.del
MON_CURRENT_SQL_202006271427.del MON_GET_REBALANCE_STATUS_202006271423.del
MON_GET_ACTIVITY_202006271422.del MON_GET_REBALANCE_STATUS_202006271424.del
MON_GET_ACTIVITY_202006271424.del MON_GET_REBALANCE_STATUS_202006271425.del
MON_GET_ACTIVITY_202006271426.del MON_GET_REBALANCE_STATUS_202006271426.del
MON_GET_ACTIVITY_202006271428.del MON_GET_REBALANCE_STATUS_202006271427.del
MON_GET_AGENT_202006271422.del MON_GET_REBALANCE_STATUS_202006271428.del
MON_GET_AGENT_202006271424.del MON_GET_REBALANCE_STATUS_202006271429.del
MON_GET_AGENT_202006271426.del MON_GET_SERVERLIST_202006271421.del
MON_GET_AGENT_202006271428.del MON_GET_SERVERLIST_202006271422.del
MON_GET_APPL_LOCKWAIT_202006271421.del MON_GET_SERVERLIST_202006271423.del
MON_GET_APPL_LOCKWAIT_202006271422.del MON_GET_SERVERLIST_202006271424.del
MON_GET_APPL_LOCKWAIT_202006271423.del MON_GET_SERVERLIST_202006271425.del
MON_GET_APPL_LOCKWAIT_202006271424.del MON_GET_SERVERLIST_202006271426.del
MON_GET_APPL_LOCKWAIT_202006271425.del MON_GET_SERVERLIST_202006271427.del
MON_GET_APPL_LOCKWAIT_202006271426.del MON_GET_SERVERLIST_202006271428.del
MON_GET_APPL_LOCKWAIT_202006271427.del MON_GET_SERVERLIST_202006271429.del
MON_GET_APPL_LOCKWAIT_202006271428.del MON_GET_TRANSACTION_LOG_202006271421.del
MON_GET_APPL_LOCKWAIT_202006271429.del MON_GET_TRANSACTION_LOG_202006271422.del
MON_GET_CONNECTION_202006271421.del MON_GET_TRANSACTION_LOG_202006271423.del
MON_GET_CONNECTION_202006271424.del MON_GET_TRANSACTION_LOG_202006271424.del
MON_GET_CONNECTION_202006271427.del MON_GET_TRANSACTION_LOG_202006271425.del
MON_GET_EXTENDED_LATCH_WAIT_202006271421.del MON_GET_TRANSACTION_LOG_202006271426.del
MON_GET_EXTENDED_LATCH_WAIT_202006271422.del MON_GET_TRANSACTION_LOG_202006271427.del
MON_GET_EXTENDED_LATCH_WAIT_202006271423.del MON_GET_TRANSACTION_LOG_202006271428.del
MON_GET_EXTENDED_LATCH_WAIT_202006271424.del MON_GET_TRANSACTION_LOG_202006271429.del
MON_GET_EXTENDED_LATCH_WAIT_202006271425.del MON_GET_UNIT_OF_WORK_202006271422.del
MON_GET_EXTENDED_LATCH_WAIT_202006271426.del MON_GET_UNIT_OF_WORK_202006271424.del
MON_GET_EXTENDED_LATCH_WAIT_202006271427.del MON_GET_UNIT_OF_WORK_202006271426.del
MON_GET_EXTENDED_LATCH_WAIT_202006271428.del MON_GET_UNIT_OF_WORK_202006271428.del
MON_GET_EXTENDED_LATCH_WAIT_202006271429.del MON_GET_UTILITY_202006271421.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271421.del MON_GET_UTILITY_202006271422.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271422.del MON_GET_UTILITY_202006271423.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271423.del MON_GET_UTILITY_202006271424.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271424.del MON_GET_UTILITY_202006271425.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271425.del MON_GET_UTILITY_202006271426.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271426.del MON_GET_UTILITY_202006271427.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271427.del MON_GET_UTILITY_202006271428.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271428.del MON_GET_UTILITY_202006271429.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006271429.del MON_GET_WORKLOAD_202006271425.del
MON_GET_INDEX_202006271425.del NETSTAT_202006271421.del
MON_GET_LATCH_202006271421.del NETSTAT_202006271422.del
MON_GET_LATCH_202006271422.del NETSTAT_202006271423.del
MON_GET_LATCH_202006271423.del NETSTAT_202006271424.del
MON_GET_LATCH_202006271424.del NETSTAT_202006271425.del
MON_GET_LATCH_202006271425.del NETSTAT_202006271426.del
MON_GET_LATCH_202006271426.del NETSTAT_202006271427.del
MON_GET_LATCH_202006271427.del NETSTAT_202006271428.del
MON_GET_LATCH_202006271428.del NETSTAT_202006271429.del
MON_GET_LATCH_202006271429.del task_details_copy.json
MON_GET_MEMORY_POOL_202006271421.del VMSTAT_202006271421.del
MON_GET_MEMORY_POOL_202006271422.del VMSTAT_202006271422.del
MON_GET_MEMORY_POOL_202006271423.del VMSTAT_202006271423.del
MON_GET_MEMORY_POOL_202006271424.del VMSTAT_202006271424.del
MON_GET_MEMORY_POOL_202006271425.del VMSTAT_202006271425.del
MON_GET_MEMORY_POOL_202006271426.del VMSTAT_202006271426.del
MON_GET_MEMORY_POOL_202006271427.del VMSTAT_202006271427.del
MON_GET_MEMORY_POOL_202006271428.del VMSTAT_202006271428.del
MON_GET_MEMORY_POOL_202006271429.del VMSTAT_202006271429.del

After a few hours, I can see that each hours data collection folders has been tar+gzipped and moved to the archive folder:

(db2inst1@hotellnx112) /home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1_archive
$ ls

TESTDB1_2020062716.tar.gz

TESTDB1_2020062717.tar.gz

TESTDB1_2020062718.tar.gz
TESTDB1_2020062719.tar.gz

The IBMHIST Control Tables

As mentioned earlier, the historical monitoring framework uses a set of control tables in order to operate. We can see the tables here:

(db2inst1@hotellnx112) /home/db2inst1/db2histmon-master/1_setup/
$ db2 connect to testdb1
$ db2 list tables for schema IBMHIST

Table/View Schema Type Creation time
------------------------------- --------------- ----- --------------------------
TAB_CONFIG IBMHIST T 2020-06-27-14.12.07.943550
TAB_DIRS IBMHIST T 2020-06-27-14.12.08.907709
TAB_ERRS IBMHIST T 2020-06-27-14.12.08.267541
TAB_TASKS IBMHIST T 2020-06-27-14.12.08.521290

The IBMHIST.TAB_CONFIG table contains information about the configuration parameters/arguments that were passed into the setup.py script (or the default value, if they were not specified).

(db2inst1@hotellnx112) /home/db2inst1/
$ db2 "select CONFIG_NAME, VALUE, DESCRIPTION from IBMHIST.TAB_CONFIG"

CONFIG_NAME VALUE DESCRIPTION
-------------------- ---------------------------------------- ----------------------------------------
COLL_PATH /home/db2inst1/sqllib/db2dump DIRECTORY PATH OF DATA COLLECTION
ARCH_PATH /home/db2inst1/sqllib/db2dump DIRECTORY PATH OF DATA ARCHIVAL
MAX_SIZE 1073741824 MAX SIZE OF COLLECTION AND ARCHIVAL
ARCH_CMD tar -caf _dest_ _src_ COMMAND USED TO ARCHIVE HOURLY DIRECTORI
ARCH_EXT .tar.gz EXTENSION OF ARCHIVE HOURLY DIRECTORIES
TASK_DETAILS_PATH /home/hotellnx112/db2inst1/db2histmon-ma LOCATION OF task_details.json FILE

6 record(s) selected.

The IBMHIST.TAB_DIRS table contains information about the current (and any previously specified) data collection paths (coll_path), and a history of each data archive path (arch_path), including the size of the archived file.

For example, we can see the current hourly data collection path as so:

(db2inst1@hotellnx112) /home/db2inst1/
$ db2 "select PATH, SIZE, STATUS, TIME from IBMHIST.TAB_DIRS where STATUS='COLL'"

PATH SIZE STATUS TIME
---------------------------------------------------------------------- -------------------- ------ --------------------------
/home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1/TESTDB1_2020062910 5133004 COLL 2020-06-29-10.43.00.000000

1 record(s) selected.

Or we can see a list of all the previously archived hourly archived folders as so:

(db2inst1@hotellnx112) /home/db2inst1/
$ db2 "select PATH, SIZE, STATUS, TIME from IBMHIST.TAB_DIRS where STATUS='ARCH' order by TIME asc"

PATH SIZE STATUS TIME
---------------------------------------------------------------------- -------------------- ------ --------------------------
/home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1_archive/TESTDB1_20200627 328432 ARCH 2020-06-27-16.59.00.000000
/home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1_archive/TESTDB1_20200627 1901572 ARCH 2020-06-27-17.59.00.000000
/home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1_archive/TESTDB1_20200627 1923296 ARCH 2020-06-27-18.59.00.000000
...etc...

15 record(s) selected.

The IBMHIST.TAB_TASKS table contains information about each data collection task, as was defined in the task_detail.json file when the full setup.py was last performed (or refreshed).

For example, we can list each active data collection task including it's operating system command or sql query:

(db2inst1@hotellnx112) /home/db2inst1/sqllib/
$ db2 "select COLL_NAME, CLASS, COMMAND from IBMHIST.TAB_TASKS"

COLL_NAME CLASS COMMAND
-------------------- ----- ----------------------------------------------------------------------
ENV_GET_SYSTEM_RESOU SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( SYSPROC.
MON_GET_AGENT SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_CONNECTION SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_ACTIVITY SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_UNIT_OF_WORK SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_CURRENT_SQL SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM SYSIBMADM.MON_CU
MON_GET_APPL_LOCKWAI SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE( MON_GET_A
MON_GET_EXTENDED_LAT SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_MEMORY_POOL SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE (MON_GET_M
MON_GET_MEMORY_SET SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_TRANSACTION_ SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
DB_GET_CFG SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( DB_GET_C
DBMCFG SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM SYSIBMADM.DBMCFG
ENV_INST_INFO SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM SYSIBMADM.ENV_IN
ENV_GET_REG_VARIABLE SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( ENV_GET_
MON_GET_EXTENT_MOVEM SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_GROUP_BUFFER SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_INDEX SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_LATCH SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_BUFFERPOOL SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_PAGE_ACCESS_ SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_PKG_CACHE_ST SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_REBALANCE_ST SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_SERVERLIST SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_TABLE SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_TABLESPACE SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_UTILITY SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
MON_GET_WORKLOAD SQL SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE ( MON_GET_
VMSTAT SYS vmstat
IOSTAT SYS iostat
NETSTAT SYS netstat

For SQL class tasks, we can also list all the column names of the generated data by examining the HEADER column:

(db2inst1@hotellnx112) /home/db2inst1/
$ db2 "select COLL_NAME, HEADER as COLUMN_NAMES from IBMHIST.TAB_TASKS where CLASS='SQL'"

COLL_NAME COLUMN_NAMES
-------------------- ----------------------------------------------------------------------
ENV_GET_SYSTEM_RESOU COLLECTION_TIME,MEMBER,OS_NAME,HOST_NAME,OS_VERSION,OS_RELEASE,MACHINE
MON_GET_AGENT COLLECTION_TIME,SERVICE_SUPERCLASS_NAME,SERVICE_SUBCLASS_NAME,APPLICAT
MON_GET_CONNECTION COLLECTION_TIME,APPLICATION_HANDLE,APPLICATION_NAME,APPLICATION_ID,MEM
MON_GET_ACTIVITY COLLECTION_TIME,APPLICATION_HANDLE,DBPARTITIONNUM,COORD_PARTITION_NUM,
MON_GET_UNIT_OF_WORK COLLECTION_TIME,SERVICE_SUPERCLASS_NAME,SERVICE_SUBCLASS_NAME,SERVICE_
MON_CURRENT_SQL COLLECTION_TIME,COORD_MEMBER,APPLICATION_HANDLE,APPLICATION_NAME,SESSI
MON_GET_APPL_LOCKWAI COLLECTION_TIME,LOCK_WAIT_START_TIME,LOCK_NAME,LOCK_OBJECT_TYPE_ID,LOC
MON_GET_EXTENDED_LAT COLLECTION_TIME,MEMBER,LATCH_NAME,TOTAL_EXTENDED_LATCH_WAITS,TOTAL_EXT
MON_GET_MEMORY_POOL COLLECTION_TIME,MEMBER,HOST_NAME,DB_NAME,MEMORY_SET_TYPE,MEMORY_POOL_T
MON_GET_MEMORY_SET COLLECTION_TIME,MEMBER,HOST_NAME,DB_NAME,MEMORY_SET_TYPE,MEMORY_SET_ID
MON_GET_TRANSACTION_ COLLECTION_TIME,MEMBER,TOTAL_LOG_AVAILABLE,TOTAL_LOG_USED,SEC_LOG_USED
DB_GET_CFG COLLECTION_TIME,NAME,VALUE,VALUE_FLAGS,DEFERRED_VALUE,DEFERRED_VALUE_F
DBMCFG COLLECTION_TIME,NAME,VALUE,VALUE_FLAGS,DEFERRED_VALUE,DEFERRED_VALUE_F
ENV_INST_INFO COLLECTION_TIME,INST_NAME,IS_INST_PARTITIONABLE,NUM_DBPARTITIONS,INST_
ENV_GET_REG_VARIABLE COLLECTION_TIME,MEMBER,REG_VAR_NAME,REG_VAR_VALUE,REG_VAR_ON_DISK_VALU
MON_GET_EXTENT_MOVEM COLLECTION_TIME,TBSP_NAME,TBSP_ID,MEMBER,CURRENT_EXTENT,LAST_EXTENT,NU
MON_GET_GROUP_BUFFER COLLECTION_TIME,MEMBER,NUM_GBP_FULL
MON_GET_INDEX COLLECTION_TIME,TABSCHEMA,TABNAME,IID,MEMBER,DATA_PARTITION_ID,NLEAF,N
MON_GET_LATCH COLLECTION_TIME,LATCH_NAME,MEMORY_ADDRESS,EDU_ID,EDU_NAME,APPLICATION_
MON_GET_BUFFERPOOL COLLECTION_TIME,BP_NAME,MEMBER,AUTOMATIC,DIRECT_READS,DIRECT_READ_REQS
MON_GET_PAGE_ACCESS_ COLLECTION_TIME,TABSCHEMA,TABNAME,OBJTYPE,MEMBER,PAGE_RECLAIMS_X,PAGE_
MON_GET_PKG_CACHE_ST COLLECTION_TIME,MEMBER,SECTION_TYPE,INSERT_TIMESTAMP,EXECUTABLE_ID,PAC
MON_GET_REBALANCE_ST COLLECTION_TIME,TBSP_NAME,TBSP_ID,DBPARTITIONNUM,MEMBER,REBALANCER_MOD
MON_GET_SERVERLIST COLLECTION_TIME,MEMBER,CACHED_TIMESTAMP,HOSTNAME,PORT_NUMBER,SSL_PORT_
MON_GET_TABLE COLLECTION_TIME,TABSCHEMA,TABNAME,MEMBER,TAB_TYPE,TAB_FILE_ID,DATA_PAR
MON_GET_TABLESPACE COLLECTION_TIME,TBSP_NAME,TBSP_ID,MEMBER,TBSP_TYPE,TBSP_CONTENT_TYPE,T
MON_GET_UTILITY COLLECTION_TIME,MEMBER,COORD_MEMBER,APPLICATION_HANDLE,APPLICATION_ID,
MON_GET_WORKLOAD COLLECTION_TIME,WORKLOAD_NAME,WORKLOAD_ID,MEMBER,ACT_ABORTED_TOTAL,ACT

28 record(s) selected.

The IBMHIST.TAB_ERRS will contain a history of any execution errors that occurred by the historical monitoring framework:

(db2inst1@hotellnx112) /home/db2inst1/
$ db2 "select TIME, CALLER, COLL_NAME, ERRMSG from IBMHIST.TAB_ERRS"

TIME CALLER COLL_NAME 4
-------------------------- -------------------- -------------------- ------------------------------------------------------------
2020-06-27-14.51.29.200058 PROC_COLLECT DUMMY_SQL_TASK Failed to write to data collection path: /home/db2inst1/MYHI...etc...

1 record(s) selected.

Example of modifying a setup parameter (changing the data collection path):

First we create a new folder to host the active data collection:

(db2inst1@hotellnx112) /home/db2inst1/
$ mkdir /home/db2inst1/MYHISTMON_COLLECTION_TESTDB1

Since the data collections are performed by external UDFs under a fenced userid (not the db2-instance-owner userid), our new folder must be accessible to the fenced userid, for simplicity I'm giving the folder permission to everyone:

(db2inst1@hotellnx112) /home/db2inst1/
$ chmod 777 MYHISTMON_COLLECTION_TESTDB1

Next I execute the setup script, specifying the --coll_path option with our new path. I also specify the --update_config_only option, since I did not modify the task_details.json file and do not need to update all the data collection tasks.

(db2inst1@hotellnx112) /home/dbb2inst1/db2histmon-master/1_setup/
$ python3 setup.py testdb1 --coll_path /home/db2inst1MYHISTMON_COLLECTION_TESTDB1/ --update_config_only

Connecting to database: TESTDB1
Configuring IBMHIST settings ...
Deleting configurations from IBMHIST.TAB_CONFIG ...
Setting COLL_PATH to '/home/db2inst1/MYHISTMON_COLLECTION_TESTDB1' ...
Setting ARCH_PATH to '/home/db2inst1/sqllib/db2dump' ...
Setting MAX_SIZE to '1073741824' ...
Setting ARCH_CMD to 'tar -caf _dest_ _src_' and ARCH_EXT to '.tar.gz' ...
Setting TASK_DETAILS_PATH to '/home/hotellnx112/db2inst1/db2histmon-master/1_setup/task_details.json' ...
Closing connection ...
Done

All now all subsequent data collection tasks will be put into the new data collection path.

(Note that existing data in the old collection path will be automatically moved into the new path).

(db2inst1@hotellnx112) /home/db2inst1/MYHISTMON_COLLECTION_TESTDB1/IBMHIST_TESTDB1/TESTDB1_2020062913
$ ls

IOSTAT_202006291304.del
IOSTAT_202006291305.del
IOSTAT_202006291306.del
IOSTAT_202006291307.del
IOSTAT_202006291308.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006291304.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006291305.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006291306.del
MON_GET_EXTENT_MOVEMENT_STATUS_202006291307.del
...etc...
...etc...

Example of Adding a new data collection task (a new operating system metric):

In this example, I want to add a new task to collect "ps -elf" output from the operating system, every minute.

First, we edit our task_details.json file to add a new entry to the bottom for our new task called 'PS_ELF' as so:
(note, I can probably just leave the collection_condition field blank, since I do not intend to run this task_details.json file in a Windows environment, and thus always executing this data collectiont ask would be fine, but for completeness I'll add UNIX as a collection_condition, since 'ps -elf' makes no sense in Windows).

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ vim task_details.json

...etc...
...etc...
,
{
"collection_name": "PS_ELF",
"collection_class": "SYS",
"collection_command": "ps -elf",
"collection_freq": "* * * * *",
"collection_level": "1",
"collection_condition": "UNIX",
"loader_join_columns": "",
"loader_diff_exempt_columns": "",
"quickparse_summary_columns": ""
}

Next, I executed the setup.py script. Since I'm only adding a new task, I'll just use the --update_tasks_only option. We can see our new PS_ELF task included:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ python3 setup.py testdb1 --update_tasks_only

Connecting to database: TESTDB1
Determining environment ...
Determining if Windows or Unix ...
Unix detected ...
Determining if pureScale ...
pureScale not detected ...
Determining if HADR ...
HADR not detected ...
Unscheduling tasks ...
Scheduling tasks ...
Setting Admin Task Scheduler registry variable ...
Reading from task_details.json file ...
Scheduling task: ENV_GET_SYSTEM_RESOURCES
Scheduling task: MON_GET_AGENT
Scheduling task: MON_GET_CONNECTION
Scheduling task: MON_GET_ACTIVITY
Scheduling task: MON_GET_UNIT_OF_WORK
Scheduling task: MON_CURRENT_SQL
Scheduling task: MON_GET_APPL_LOCKWAIT
Scheduling task: MON_GET_EXTENDED_LATCH_WAIT
Scheduling task: MON_GET_MEMORY_POOL
Scheduling task: MON_GET_MEMORY_SET
Scheduling task: MON_GET_TRANSACTION_LOG
Scheduling task: DB_GET_CFG
Scheduling task: DBMCFG
Scheduling task: ENV_INST_INFO
Scheduling task: ENV_GET_REG_VARIABLES
Scheduling task: MON_GET_EXTENT_MOVEMENT_STATUS
Scheduling task: MON_GET_GROUP_BUFFERPOOL
Scheduling task: MON_GET_INDEX
Scheduling task: MON_GET_LATCH
Scheduling task: MON_GET_BUFFERPOOL
Scheduling task: MON_GET_PAGE_ACCESS_INFO
Scheduling task: MON_GET_PKG_CACHE_STMT
Scheduling task: MON_GET_REBALANCE_STATUS
Scheduling task: MON_GET_SERVERLIST
Scheduling task: MON_GET_TABLE
Scheduling task: MON_GET_TABLESPACE
Scheduling task: MON_GET_UTILITY
Scheduling task: MON_GET_WORKLOAD
Scheduling task: VMSTAT
Scheduling task: IOSTAT
Scheduling task: NETSTAT
Scheduling task: PS_ELF
Scheduling task: ARCHIVE

Collection will begin in approximately 5 minutes ...

Closing connection ...
Done

If we examine the IBMHIST.TAB_TASKS table, we can also see the new PS_ELF data collection task:

db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ db2 "select COLL_NAME from IBMHIST.TAB_TASKS where COLL_NAME='PS_ELF'"

COLL_NAME
----------------------------------------------------------------
PS_ELF

1 record(s) selected.

If we examine the SYSTOOLS.ADMIN_TASK_LIST table, we can see that our PS_ELF task has been added to the admin task scheduler:

db2inst1@hotellnx112> /home/db2inst1/
$ db2 "SELECT NAME, PROCEDURE_NAME, REMARKS from SYSTOOLS.ADMIN_TASK_LIST"

NAME PROCEDURE_NAME REMARKS
------------------------------ -------------------- ------------------------------
MON_GET_LATCH PROC_COLLECT IBMHIST Collection
MON_GET_BUFFERPOOL PROC_COLLECT IBMHIST Collection
MON_GET_PAGE_ACCESS_INFO PROC_COLLECT IBMHIST Collection
MON_GET_PKG_CACHE_STMT PROC_COLLECT IBMHIST Collection
MON_GET_REBALANCE_STATUS PROC_COLLECT IBMHIST Collection
MON_GET_SERVERLIST PROC_COLLECT IBMHIST Collection
MON_GET_TABLE PROC_COLLECT IBMHIST Collection
MON_GET_TABLESPACE PROC_COLLECT IBMHIST Collection
MON_GET_UTILITY PROC_COLLECT IBMHIST Collection
MON_GET_WORKLOAD PROC_COLLECT IBMHIST Collection
VMSTAT PROC_COLLECT IBMHIST Collection
IOSTAT PROC_COLLECT IBMHIST Collection
NETSTAT PROC_COLLECT IBMHIST Collection
ARCHIVE PROC_ARCHIVE IBMHIST Archival
PS_ELF PROC_COLLECT IBMHIST Collection
MON_GET_FCM PROC_COLLECT IBMHIST Collection
MON_GET_AGENT PROC_COLLECT IBMHIST Collection
MON_GET_CONNECTION PROC_COLLECT IBMHIST Collection
MON_GET_ACTIVITY PROC_COLLECT IBMHIST Collection
MON_GET_UNIT_OF_WORK PROC_COLLECT IBMHIST Collection
MON_CURRENT_SQL PROC_COLLECT IBMHIST Collection
MON_GET_APPL_LOCKWAIT PROC_COLLECT IBMHIST Collection
MON_GET_EXTENDED_LATCH_WAIT PROC_COLLECT IBMHIST Collection
MON_GET_MEMORY_POOL PROC_COLLECT IBMHIST Collection
MON_GET_MEMORY_SET PROC_COLLECT IBMHIST Collection
MON_GET_TRANSACTION_LOG PROC_COLLECT IBMHIST Collection
DB_GET_CFG PROC_COLLECT IBMHIST Collection
DBMCFG PROC_COLLECT IBMHIST Collection
ENV_INST_INFO PROC_COLLECT IBMHIST Collection
ENV_GET_REG_VARIABLES PROC_COLLECT IBMHIST Collection
MON_GET_EXTENT_MOVEMENT_STATUS PROC_COLLECT IBMHIST Collection
MON_GET_GROUP_BUFFERPOOL PROC_COLLECT IBMHIST Collection
MON_GET_INDEX PROC_COLLECT IBMHIST Collection
ENV_GET_SYSTEM_RESOURCES PROC_COLLECT IBMHIST Collection

34 record(s) selected.

Lastly, if we wait approximately 5mins for the Db2 Admin Task Scheduler to process the new task, we can then begin seeing the output of our new PS_ELF task within the hourly data collection folder:

<db2inst1@hotellnx112> /home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1/TESTDB1_2020062717/
$ ls -l

DB_GET_CFG_202006271700.del
DBMCFG_202006271700.del
...etc...

...etc...

PS_ELF_202006271700.del

...etc...

Example of Adding a new data collection task (a new sql query):

In this example, I want to add a new task to collect MON_GET_FCM() output every 10mins.

First, I'll edit the task_details.json file to add a new entry to the bottom for our new task called 'MON_GET_FCM', as so:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ vim task_details.json

...etc...
...etc...
,
{
"collection_name": "MON_GET_FCM",
"collection_class": "SQL",
"collection_command": "SELECT CURRENT TIMESTAMP AS COLLECTION_TIME, T.* FROM TABLE (MON_GET_FCM(-2)) T",
"collection_freq": "10 * * * *",
"collection_level": "1",
"collection_condition": "",
"loader_join_columns": "",
"loader_diff_exempt_columns": "",
"quickparse_summary_columns": ""
}

Next, I'll invoke the setup.py script. Here I use the --update_tasks_only option. We can see the new MON_GET_FCM task added:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup.
$ python3 setup.py testdb1 --update_tasks_only

Connecting to database: TESTDB1
Determining environment ...
Determining if Windows or Unix ...
Unix detected ...
Determining if pureScale ...
pureScale not detected ...
Determining if HADR ...
HADR not detected ...
Unscheduling tasks ...
Scheduling tasks ...
Setting Admin Task Scheduler registry variable ...
Reading from task_details.json file ...
Scheduling task: ENV_GET_SYSTEM_RESOURCES
Scheduling task: MON_GET_AGENT
Scheduling task: MON_GET_CONNECTION
Scheduling task: MON_GET_ACTIVITY
Scheduling task: MON_GET_UNIT_OF_WORK
Scheduling task: MON_CURRENT_SQL
Scheduling task: MON_GET_APPL_LOCKWAIT
Scheduling task: MON_GET_EXTENDED_LATCH_WAIT
Scheduling task: MON_GET_MEMORY_POOL
Scheduling task: MON_GET_MEMORY_SET
Scheduling task: MON_GET_TRANSACTION_LOG
Scheduling task: DB_GET_CFG
Scheduling task: DBMCFG
Scheduling task: ENV_INST_INFO
Scheduling task: ENV_GET_REG_VARIABLES
Scheduling task: MON_GET_EXTENT_MOVEMENT_STATUS
Scheduling task: MON_GET_GROUP_BUFFERPOOL
Scheduling task: MON_GET_INDEX
Scheduling task: MON_GET_LATCH
Scheduling task: MON_GET_BUFFERPOOL
Scheduling task: MON_GET_PAGE_ACCESS_INFO
Scheduling task: MON_GET_PKG_CACHE_STMT
Scheduling task: MON_GET_REBALANCE_STATUS
Scheduling task: MON_GET_SERVERLIST
Scheduling task: MON_GET_TABLE
Scheduling task: MON_GET_TABLESPACE
Scheduling task: MON_GET_UTILITY
Scheduling task: MON_GET_WORKLOAD
Scheduling task: VMSTAT
Scheduling task: IOSTAT
Scheduling task: NETSTAT
Scheduling task: PS_ELF
Scheduling task: MON_GET_FCM
Scheduling task: ARCHIVE

Collection will begin in approximately 5 minutes ...

Closing connection ...
Done

We can see our new task ash been added to the IBMHIST.TAB_TASKS table:

<db2inst1@hotellnx112> /home/db2inst1/
$ db2 "select COLL_NAME from IBMHIST.TAB_TASKS where COLL_NAME='MON_GET_FCM'"

COLL_NAME
----------------------------------------------------------------
MON_GET_FCM

1 record(s) selected.

Lastly, if we wait approximately 5mins for the Db2 Admin Task Scheduler to process the new task, we can then begin seeing the output of our new MON_GET_FCM task within the hourly data collection folder:

<db2inst1@hotellnx112> /home/db2inst1/sqllib/db2dump/IBMHIST_TESTDB1/TESTDB1_2020062914/
$ ls -l

DB_GET_CFG_202006291400.del
DBMCFG_202006291400.del
...etc...
...etc...
MON_GET_FCM_202006291439.del
...etc...
...etc...

Example of Removing an data collection task:

The cleanest way to remove a data collection task is to edit the task_details.json file, remove the desired task entry, and then run the setup.py script again, using the --update_tasks_only option.

For example, if I wanted to remove the iostat data collection task, I would remove this entry from the task_details.json file:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ vim task_details.json

...etc...
...etc...
{
~~"collection_name": "IOSTAT",~~
~~"collection_class": "SYS",~~
~~"collection_command": "iostat",~~
~~"collection_freq": "* * * * *",~~
~~"collection_level": "1",~~
~~"collection_condition": "UNIX",~~
~~"loader_join_columns": "",~~
~~"loader_diff_exempt_columns": "",~~
~~"quickparse_summary_columns": ""~~
},
...etc...
...etc...

And the re-run the setup.py script using the --update_tasks_only option:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/

$ python3 setup.py testdb1 --update_tasks_only

Connecting to database: TESTDB1
Determining environment ...
Determining if Windows or Unix ...
Unix detected ...
Determining if pureScale ...
pureScale not detected ...
Determining if HADR ...
HADR not detected ...
Unscheduling tasks ...
Scheduling tasks ...
Setting Admin Task Scheduler registry variable ...
Reading from task_details.json file ...
Scheduling task: ENV_GET_SYSTEM_RESOURCES
Scheduling task: MON_GET_AGENT
Scheduling task: MON_GET_CONNECTION
Scheduling task: MON_GET_ACTIVITY
Scheduling task: MON_GET_UNIT_OF_WORK
Scheduling task: MON_CURRENT_SQL
Scheduling task: MON_GET_APPL_LOCKWAIT
Scheduling task: MON_GET_EXTENDED_LATCH_WAIT
Scheduling task: MON_GET_MEMORY_POOL
Scheduling task: MON_GET_MEMORY_SET
Scheduling task: MON_GET_TRANSACTION_LOG
Scheduling task: DB_GET_CFG
Scheduling task: DBMCFG
Scheduling task: ENV_INST_INFO
Scheduling task: ENV_GET_REG_VARIABLES
Scheduling task: MON_GET_EXTENT_MOVEMENT_STATUS
Scheduling task: MON_GET_GROUP_BUFFERPOOL
Scheduling task: MON_GET_INDEX
Scheduling task: MON_GET_LATCH
Scheduling task: MON_GET_BUFFERPOOL
Scheduling task: MON_GET_PAGE_ACCESS_INFO
Scheduling task: MON_GET_PKG_CACHE_STMT
Scheduling task: MON_GET_REBALANCE_STATUS
Scheduling task: MON_GET_SERVERLIST
Scheduling task: MON_GET_TABLE
Scheduling task: MON_GET_TABLESPACE
Scheduling task: MON_GET_UTILITY
Scheduling task: MON_GET_WORKLOAD
Scheduling task: VMSTAT
Scheduling task: NETSTAT
Scheduling task: PS_ELF
Scheduling task: MON_GET_FCM
Scheduling task: ARCHIVE

Collection will begin in approximately 5 minutes ...

Closing connection ...
Done

Example of Stopping all data collection tasks:

The easiest way to stop all data collection tasks is to simply disable and remove the historical monitoring framework from the database, using the setup.py --cleanup argument:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup/
$ python3 setup.py testdb1 --cleanup

Connecting to database: TESTDB1
Unscheduling tasks ...
Dropping IBMHIST schema and its objects ...
Closing connection ...
Done

You can validate that no data collection tasks are scheduled within the Admin Task Scheduler by:

<db2inst1@hotellnx112> /home/db2inst1/db2histmon-master/1_setup.
$ db2 "SELECT NAME, PROCEDURE_NAME from SYSTOOLS.ADMIN_TASK_LIST"

NAME PROCEDURE_NAME
----------------------- --------------------------------------------------------

0 record(s) selected.

Stay tuned for new blogs expanding on the Db2 Historical Monitoring (db2histmon) suite, coming soon.

Wednesday 18 November 2020

Generating Reports from Db2 historical monitoring (db2histmon) data

Downloading the db2histmon report script:

Pre-requisites:

Types of reports:

Historical data which is used to generate the report:

Usage and options:

Example 1. Generate a DBSUMMARY report (on all available historical monitoring data)

Example 2. Generate a report on a subset of time-intervals, by using the START_TIME and END_TIME options

Example 3. Generate a report with time-intervals condensed into larger time PERIODs.

Example 4. Generate a report showing MIN, MAX, MEAN statistical values.

IBM Db2 - Technical (Lab) Advocate Newsletter - November 2020

Tuesday 30 June 2020

IBM Db2 - Technical (Lab) Advocate Newsletter - June 2020

Enabling Db2 Historical Monitoring (db2histmon) - continuous diagnostic data collection and archival

Downloading the Db2 Historical Monitoring (db2histmon) scripts:

Pre-requisites:

Overview of the Db2 Historical Monitoring framework:

Setup Folder:

Overview of the task_details.json file:

setup.py usage and arguments:

Example of running the setup.py script:

The Data Collection and Data Archive folders:

Example of modifying a setup parameter (changing the data collection path):

Example of Adding a new data collection task (a new operating system metric):

Example of Adding a new data collection task (a new sql query):

Example of Removing an data collection task:

Example of Stopping all data collection tasks:

Translate