（急）跌跌撞撞前行：greenplum FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)

找到问题的答案了，原来shared_buffers确实是不需要设置那么大的，摘一段放这，想看详细的请点连接。
https://support.pivotal.io/hc/communities/public/questions/205239098-Unable-to-set-shared-buffers-to-2-GB

Michael,

While pg_tune is a good utility for a standalone database - it does not apply to the MPP setup used by Greenplum which needs overhead to operate multiple standalone databases communicating on a private network.
You cannot set the shared buffers to a higher value, because it is set as an integer -- which caps out at 2GB, and that would be overflow.

For tuning, you really want to focus on the gp_vmem_protect_limt, statement_memory and resource queues.
And while you want to squeeze every bit of performance out of your cluster - you need to leave room for overhead and things that aren't calculated in the memory usage - simply because greenplum or the OS has to do them -- like unpacking the plan.

An OS setting of vm.overcommit_memory = 2 with the default vm.overcommit_ratio being .5 limits the amount of available OS Virtual Memory to: ( swap + ( vm.overcommit_ratio * RAM ) )
In your case, this would be a segment max of (Swap + (64 * .5))

Your gp_vmem_protect_limit should be set to: ( OS Virtual Memory *.9 / Number of Primary Segments Per Server )
This leaves 10% overhead, but does not account for a saturated system with failover... so you could set it for less for stability.
For example, if you have mirroring enabled (recommended) and you have failover, one or more nodes would now be carrying one or more additional primary segments -- so if you were tuned for 4 primaries per node and failed over under heavy load, you increase the possibility of the segments running out of memory.

Your statment_mem should be set to:
( gp_vmem_protect_limt / number of concurrent queries allowed by the resource queues)

I would recommend leaving the default setting for any non-custom queue, which leaves you room to bump up the other for queues with massive queries -- though you can simply set the presql at runtime as necessary: set statement_mem=xxx; select....

These settings safeguard your system from virtual memory crashes -- if they are setup correctly, the resource queue will prevent queries that would exceed your gp_vmem limit. Your gp_vmem limit should protect your OS from running out of virtual memory. Your OS virtual memory settings should prevent your OS from running out of memory...

Correctly tuned, they allow your queries to run with optimal performance with overhead to manage all the moving components.

GPDB will always try to use 100% of it's resources, unless otherwise capped by maximums on the resource queues.
To get a better understanding of this, take a look at this short video, which does a very good job of explaining how they work effectively:
https://www.youtube.com/watch?v=1b0mHsT_woU

Hope this helps --

（急）跌跌撞撞前行：greenplum FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)

相关文章

相关电子书