Categorized | MySQL

XtraDB feature: save / restore buffer pool

Posted on 19 February 2010 by Abidoon

We recently released XtraDB-9, and while we did not highlight it in announcement, the release-making feature is ability to save and restore InnoDB buffer pool.
The idea is not new and was originally developed by Jeremy Cole (sorry, I do not have the link on hands) some time ago, and now we implemented it in XtraDB.

Why would we need to save and restore content of buffer pool ?
There are several reasons.
First, it’s not rate on modern servers to have 32GB+ of RAM, with allocated InnoDB buffer_pool 26GB or more. When you do restart of server, it may take long time to populate cache with useful data before you can bring it back to serve production load. It’s not rare to see
maintenance cycle takes two or more hours, mainly because the slave need to catchup with master and to warm cache.
In case with the server crash, it is even worse, you need to wait possible long time on InnoDB
recovery (we have the patch for that too, in that post you can see InnoDB recovery took 1h to accomplish) and after that warm caches.

Second, it is useful for some HA schemas, like DRBD, when, in case of failover, you need to start passive instance on cold.

So let’s see what results we have.
Details about patch you can get there http://www.percona.com/docs/wiki/percona-xtradb:patch:innodb_lru_dump_restore (Yasufumi names it LRU dump/restore, because he thinks about buffer pool as about LRU list, which how it is internally).

To save buffer pool you execute
select * from information_schema.XTRADB_ADMIN_COMMAND /*!XTRA_LRU_DUMP*/;

and to restore
select * from information_schema.XTRADB_ADMIN_COMMAND /*!XTRA_LRU_RESTORE*/;

it will create/read file ib_lru_dump from your database directory.

You may want to sort ib_lru_dump in order of pages in tablespaces, so RESTORE will be
performed in most sequential way. The small python script lrusort.py to sort ib_lru_dump is available
in our Launchpad branch lp:~percona-dev/percona-xtradb/extensions-1.0.6

I made small tpcc benchmark to show effect with restored buffer_pool (the condition of
benchmarks are the same as in my runs on fast storages, and I
used RAID10 to store InnoDB files).
First run (xtradb cold) I made just after restart and ran it for 1h.
After that I saved buffer_pool, restarted mysqld, restored buffer_pool ( it took about 4 min
to load 26GB worth of data), and run (xtradb warm) tpcc again.

Here is graphical results (results in New Transactions Per 10 sec, more is better):

tpcc_1000w

As you see in the cold run it took 1500-1800 sec to enter into stable mode, while
it warm run it happened almost from start. There was some period of unstable results, but it
did not affect ability to serve load.

You are welcome to test it, it is available in XtraDB-9 release and also in MariaDB 5.1.41-RC.


Entry posted by Vadim |
14 comments

Add to: delicious | digg | reddit | netscape | Google Bookmarks

View full post on MySQL Performance Blog

Tags | , , , , ,

10 Responses to “XtraDB feature: save / restore buffer pool”

  1. Patrick Mulvany says:

    Vadim, How is this patch effected by increases or decreases in InnoDB buffer pool size parameter?
    I would assume that changing this would not be a good idea.

  2. Vadim says:

    Tobias,

    Basically yes.
    You need though to sort dump of buffer_pool, so it will be loaded sequentially.
    Also dumping back to RAM may take some time (4 mins in my experiment), but it faster then work with cold cache.

  3. Tobias Petry says:

    Vadim, this means we can make a BufferPool-Dump, run the server 2 hours, crash it and after restarting (and innodb’s recovery) and dumping the BufferPool back to the RAM we have an warm InnoDB instance with no stale data? And no other problems?

  4. Vadim says:

    Baron,

    There is no reason why it will not work.

    I as said we store just pointers to pages (space_id, page_id),
    and at restore stage we just read pages by pointers.
    It does not matter if InnoDB crashed before, did recovery procedure, etc.

  5. Baron Schwartz says:

    Vadim, how does this interact with recovery? Does it work OK if you save the buffer pool contents, then crash the server, restart it, and restore the buffer pool? If not, then that might be a problem for the DRBD use case.

    I think it should work fine, but maybe I’m wrong.

  6. Vadim says:

    Kim,

    We store only pointers on data pages (space_id, page_id).
    There is no such problem as stalled data in this case.

  7. Vadim says:

    Mark,

    Agree, your suggestions are taken.

  8. Kim says:

    When you dump the bufferpool, what data is dumped? Is it just pointers to what rows etc that needs to be loaded or is it the actual data in the pool that is dumped.. .

    Im thinking if you restore an old pool will you risk getting invalid cached data or is the buffer pool re-validated against what is stored in the database when loaded.

  9. Mark R says:

    I’d like to see this made fully automatic – so it would automatically dump the LRU list periodically after a certain amount of uptime – say 24h and also at server shutdown if it had been running for long enough, and automatically load it on restart.

    These could be tunables of course.

    This would mean that most users could just forget about it and have cache warmup goodness happen by itself.

  10. Vadim says:

    Peter,

    Agree on 1) and 2), there is room for improvements.

    on 3)
    I guess warmup goes down by the same reason – started flushing activity, however worth to check it.
    We use adaptive checkpoint, I have no exact answer right now why there so uneven performance for 10 sec samples. Yasufumi suspects new mid-point insertion algorithm, but we need to research it.


Leave a Reply


 

February 2010
M T W T F S S
« Oct   Mar »
1234567
891011121314
15161718192021
22232425262728