interconnect encountered a network error, please check your network

Question

we have installed Greenplum Database server version 6.24.2.

we have configured one stand alone Master node and 18 segment node.

Sometimes when executing SQL statements, the following error will be reported, but when the same statement is executed again, it is possible to succeed.

The error is:

interconnect encountered a network error, please check your network (seg8 slice11 192.168.1.210:40008 pid=1627342). Failed to send packet(seq 1) to 192.168.1.211:25366(pid 1630839 cid 12) after 303 retries in 300 seconds.

We have already ajusted OS Parameters Settings, such as ipfrag_high_thresh、ipfrag_low_thresh、ipfrag_time.

We use iperf3 to test the UDP transmission status: iperf3 -c xx.xx.xx.xx -u -b 5000M -f M -t 50, and the loss rate is about 0.5%.

Answer

I would check those values against these two GUCs:

Gp_interconnect_transmit_timeout and Gp_interconnect_min_retries_before_timeout

This GitHub issue may explain the retries value and how the value increments for a similar issue. Just want to make it clear, it may not necessarily be the same issue you are facing: https://github.com/greenplum-db/gpdb/issues/12961

VMware Tanzu Greenplum

interconnect encountered a network error, please check your network