Robert's Db2 blog: DB2 for z/OS: Want to use High-Performance DBATs? Check your MAXDBAT Value

Sunday, December 29, 2013

DB2 for z/OS: Want to use High-Performance DBATs? Check your MAXDBAT Value

Of the features introduced with DB2 10 for z/OS, high-performance DBATs is one of my favorites. It enabled (finally) DDF-using applications to get the CPU efficiency benefit that comes from combining thread reuse with the RELEASE(DEALLOCATE) package bind option -- a performance tuning action that has long been leveraged for CICS-DB2 workloads. Implementing high-performance DBATs is pretty easy: in a DB2 10 (or 11) environment, when a package bound with RELEASE(DEALLOCATE) is executed by way of a DBAT (i.e., a database access thread -- the kind used for DRDA requesters that connect to DB2 via the distributed data facility), that thread becomes a high-performance DBAT (if it isn't one already). Before jumping into this, however, you should consider some things that are impacted by the use of high-performance DBATs. One of those things is the DBAT pool. That's where the MAXDBAT parameter of ZPARM comes in, and that's what this blog entry is about.

The value of MAXDBAT determines the maximum number of DBATs that can be concurrently active for a DB2 subsystem. The default value is 200, and at many sites that value, or one that's a little larger, has effectively supported a much greater number of DB2 client-server application connections (the default value for CONDBAT in ZPARM -- the maximum number of connections through DDF to a DB2 subsystem -- is 10,000). How so? Well, if your system is set up to allow for inactive connections (CMTSTAT = INACTIVE has been the default in ZPARM since DB2 V8), when a DDF transaction completes the associated connection will go into an inactive state (a very low-overhead transition, as is the transition back to the active state) and the DBAT used for the transaction will go into the DBAT pool, ready to service another transaction. That can happen because a "regular" DBAT is only associated with a particular DB2 connection while it is being used to execute a request from said connection. Because it is common for only a small percentage of DDF connections to a DB2 subsystem to be active (i.e., associated with in-flight transactions) at any given moment, a large ratio of connections to DBATs has historically been no problem at all.

Bring high-performance DBATs into the picture, and things change. In particular, a high-performance DBAT, once instantiated, will remain dedicated to the connection through which it was instantiated until it's been reused by 200 units of work (at which point it will be terminated, so as to free up resources allocated to the thread). That high-performance DBAT, therefore, will NOT go into the DBAT pool when a transaction using the thread completes. When a request associated with another connection comes in (i.e., from a connection other than the one through which the high-performance DBAT was instantiated), the high-performance DBAT won't be available to service that request. Some other DBAT will have to be used, and guess what? If that DBAT isn't a high-performance DBAT, it will become one if the package associated with the incoming request (and that could be a DB2 Connect or IBM Data Server Driver package) was bound with RELEASE(DEALLOCATE). The DBAT pool thus becomes progressively smaller as high-performance DBATs are instantiated. Know what else happens? The number of active DBATs goes up -- maybe sharply. Why? Because a "regular" DBAT is active only while it is being used to execute a DDF transaction. A high-performance DBAT, on the other hand, is considered to be active as long as it exists -- that will be 200 units of work, as mentioned previously, and when a high-performance DBAT is waiting to be reused, it's an active DBAT.

This last point -- about the number of active DBATs potentially rising sharply when high-performance DBATs are utilized -- is illustrated by some information I recently received from a DB2 professional. At this person's shop, high-performance DBATs were "turned on" for a DB2 subsystem (the PKGREL option of the -MODIFY DDF command can be used as a "switch," telling DB2 to either honor RELEASE(DEALLOCATE) for packages executed via DBATs -- thereby enabling instantiation of high-performance DBATs -- or not), and the number of active DBATs for the subsystem went from the usual 60 or so to about 700. Because the MAXDBAT value for the DB2 subsystem was already at 750, these folks didn't run out of DBATs, but the pool of "regular" DBATs got pretty small. In response to the big increase in active DBATs seen when high-performance DBAT functionality was enabled, the MAXDBAT value for the DB2 system in question was increased to 2000. Was this OK? Yes: When packages are bound or rebound in a DB2 10 for z/OS environment, almost all thread-related virtual storage goes above the 2 GB "bar" in the DBM1 address space, and that allows for a 5- to 10-times increase in the number of threads that can be concurrently active for the DB2 subsystem.

So, if you're thinking about using high-performance DBATs (and you should), check your subsystem's MAXDBAT value, and consider making that value substantially larger than it is now. Additionally, take steps to enable selective use of high-performance DBATs by your network-attached, DB2-accessing applications. For programs that contain embedded SQL statements and, therefore, have their own packages (e.g., DB2 stored procedures -- both external and native), use RELEASE(DEALLOCATE) for the most frequently executed of these packages. For the packages associated with DB2 Connect and/or the IBM Data Server Driver, use two collections: The default NULLID collection, into which you'd bind the DB2 Connect and/or IBM Data Server Driver packages with RELEASE(COMMIT), and another collection (named as you want) into which you'd bind these packages with RELEASE(DEALLOCATE). Then, by way of a data source or connection string specification on the client side, direct DDF-using applications to NULLID or the other collection name, depending on whether or not you want high-performance DBATs to be used for a given application.

To keep an eye on DBAT usage for a DB2 subsystem, periodically issue the command -DISPLAY DDF DETAIL. In the output of that command you'll see a field, labeled QUEDBAT, that shows the number of times (since the DB2 subsystem was last started) that requests were delayed because the MAXDBAT limit had been reached. If the value of this field is non-zero, consider increasing MAXDBAT for the subsystem. You might also want to look at the value of the field DSCDBAT in the output of the -DISPLAY DDF DETAIL command. This value shows you the current number of DBATs in the pool for the subsystem. As I've pointed out, maintaining the "depth" of the DBAT pool as high-performance DBAT functionality is put to use might require increasing MAXDBAT for your DB2 subsystem.

DDF activity can also be tracked by way of your DB2 monitor. I particularly like to use a DB2 monitor-generated Statistics Long Report to see if the connection limit for a DB2 subsystem (specified via the CONDBAT parameter in ZPARM) is sufficiently high. In the section of the report under the heading "Global DDF Activity," I'll check the value of the field labeled CONN REJECTED-MAX CONNECTED (or something similar -- fields in reports generated by different DB2 monitors might be labeled somewhat differently). A non-zero value in this field is an indication that the CONDBAT limit has been hit, and in that case you'd probably want to set CONDBAT to a larger number to allow more connections to the DB2 subsystem.

So there you go. Using high-performance DBATs can improve the CPU efficiency of your DB2 for z/OS client-server workload, but if you do leverage high-performance DBAT functionality then you might need to boost the DBAT limit for your DB2 subsystem in order to maintain the depth of your DBAT pool, because as high-performance DBATs increase in number, pooled DBATs decrease in number (unless you've upped your MAXDBAT value to compensate for this effect). Boosting MAXDBAT in a DB2 10 (or 11) environment is OK, as thread-related virtual storage in such an environment is almost entirely above the 2 GB "bar" in the DBM1 address space (assuming that packages have been bound or rebound with DB2 at the Version 10 or 11 level). Of course, you need real storage to back virtual storage, so if you increase the MAXDBAT value keep an eye on the z/OS LPAR's demand paging rate and make sure that this doesn't get out of hand (if the demand paging rate is in the low single digits or less per second, it's not out of hand).

27 comments:

AnonymousDecember 30, 2013 at 2:38 PM
Robert, please elaborate on how it's possible that active DBATS "went from the usual 60 or so to about 700. Because the MAXDBAT value for the DB2 subsystem was already at 750 . . .".

High Performance DBATs are limited to half of MAXDBATs, so in this case 750/2 = 365, so I would expect worst case 365 + 60 = 425, so much less than 700.

Did they ignore the documented recommendations to restrict high performance DBATs to high volume, light transaction connections that disconnect when they don't have SQL work?

Did they ignore, by thread reuse and RELEASE(DEALLOCATE) analogy, the CICS-DB2 recommendations to limit the the use of CICS protected threads to high volume, light transactions?

But even if they did, the half of MAXDBAT cap should have limited the high performance DBATs to a maximum of 375.
ReplyDelete
Replies
AnonymousDecember 31, 2013 at 6:04 AM
Robert, I should have written 750/2 = 375, so I would expect worst case
375 + 60 = 435, so much less than 700.
ReplyDelete
Replies
AnonymousDecember 31, 2013 at 9:27 AM
Robert, thanks for your answers. The basis for my claim are some IBM presentations including page 41 of the March 13, 2012, SHARE 2012
Session 10996, "DB2 for z/OS Distributed Access – Best Practices and Updates" by Adrian Burke, which states for DB2 10:

"If # of Hi-Perf DBATs exceed 50% of MAXDBAT threshold

• DBATs will be pooled at commit and package resources copied/allocated as RELEASE(COMMIT)"

Which I interpreted, based on the assumption that transactions are committed, that MAXDBAT/2 was the effective cap. Hence my request for elaboration of your scenario. Have I misinterpreted the bullet? Is the actual implementation different?
ReplyDelete
Replies
AnonymousJanuary 2, 2014 at 8:26 AM
Robert, thank you for explaining about the presentation. Too bad that it wasn't implemented, I thought it was an excellent idea and safeguard. Do any other things like this come to mind (presented after GA but not implemented or backed out)?

I was also wrong earlier to write "documented recommendations" and wrong not to limit my comments to Connection Pooling, with an alternative being commit oriented Connection Concentrator.

Looking again at pages 6-10 of the the Adrian Burke presentation that mentioned previously, I see that I read in my own interpretation and recommendations into the text.

ReplyDelete
Replies
AnonymousJanuary 2, 2014 at 8:55 AM
Robert, in a recent presentation of yours you wrote:

"Best uses:
–Higher-volume transactions – especially those with lower SQL
statement execution cost (for these transactions, CPU cost of release
and reacquisition of resources at COMMIT is proportionately higher)
–For batch programs that issue a lot of commits"

Which applies for RELEASE(DEALLOCATE) usage in general.

"Have higher-volume client-server transactions use that second collection to gain high-performance DBAT performance benefits (collection name can be specified as a data source property on the client side)"

Which would work best for well-design and implemented Connection Pooling and Connection Concentrator configurations.

When you wrote:

"Because high-performance DBATs do not go back into the DBAT pool, you may want to increase value of MAXDBAT in ZPARM to compensate"

You use of "may want to" suggests that you weren't expecting active DBATs to increase from 60 to 700 in most places.
ReplyDelete
Replies
AnonymousFebruary 2, 2014 at 7:52 AM
Robert - in a CMTSTAT=INACTIVE setting, is there a periodic clean-up of the INACTIVE connections (i.e, if the app servers do not close out connections/Websphere settings are not robust enough to clean the connections in a timely manner)
ReplyDelete
Replies
AnonymousApril 27, 2016 at 7:54 AM
Hi Robert, I am running my DB2 Connect on the Linux system connecting to DB2 on mainframes. If I run a db2 batch program on linux, a new thread are created on mainframes after every commit point.

If I take ten commits the there are 10 threads on Linux but when i run the same on mainframes I get only single thread.

Please could you let me know how to run application programs under single thread on distributed systems??
ReplyDelete
Replies
AnonymousMay 10, 2016 at 2:39 AM
Hello Robert!

First, let me thank you for your detailed, explained and very understandable blog posts!

I read somewhere that the storage consumption of DBATs in the DBM1 address space is about ~200KB and that inactive connection storage consumption is about ~7.5KB in the DIST address space.

Can you verify this information? Can i assume it's correct?

Thanks!
Mark
ReplyDelete
Replies
SaiduluMay 25, 2021 at 1:21 PM
Hi Robert
I have a situation where CONDBAT=300 and INACONN reached upto 297 thus causing new connections to fail. we found that suddenly one of the application making large no of connections to Db2 . I want to understand how these inactive connections get terminated by DB2 ? or how to terminate them explicitly if needed ? or application should write logic to close the conenction properly to avoid this situation?
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.