Task #10662: orientdb01-d4s.d4science.org keeps crashing badly - D4Science Infrastructure - D4science

Actions

Copy link

Task #10662

closed

orientdb01-d4s.d4science.org keeps crashing badly

Added by Andrea Dell'Amico over 7 years ago. Updated over 7 years ago.

Status:

Closed

Priority:

Urgent

Assignee:

Luca Frosini

Category:

Application

Target version:

No Sprint

Start date:

Dec 12, 2017

Due date:

% Done:

100%

Estimated time:

Infrastructure:

Production

Description

It stops in state that it's impossible for a nagios handler to restart it: the process must be brutally killed. The server runs for 6 days at most and then stops responding, always with the same error:

Error during WAL background flush
java.lang.OutOfMemoryError: Java heap space
java.lang.OutOfMemoryError: GC overhead limit exceeded
java.lang.OutOfMemoryError: GC overhead limit exceeded
Error during WAL background flush
java.lang.OutOfMemoryError: Java heap space
Error during fuzzy checkpoint
java.lang.OutOfMemoryError: GC overhead limit exceeded
java.lang.OutOfMemoryError: GC overhead limit exceeded

Can you investigate what's needed? More heap? a different GC configuration? a newer version? I don't want to spend time killing and restarting the process every few days.

Actions

Copy link

Updated by Luca Frosini over 7 years ago

Status changed from New to In Progress

Searching the error I found this solution:

https://stackoverflow.com/questions/37419616/orientdb-2-1-exception-in-thread-orientdb-wal-flush-task-with-java-lang-outofm

Moreover I found:
https://stackoverflow.com/questions/40013369/orientdb-java-heap-error/
The answer of Oleksandr Gubchenko point to this:
http://orientdb.com/docs/last/Performance-Tuning.html

Actions

Copy link

Updated by Andrea Dell'Amico over 7 years ago

So they say that we need more heap and more memory. The tuning doc seems to have some good advice, you should try something with the dev instances maybe?

Actions

Copy link

Updated by Luca Frosini over 7 years ago

Andrea Dell'Amico wrote:

So they say that we need more heap and more memory.

Yes, please provide 4G if possible

The tuning doc seems to have some good advice, you should try something with the dev instances maybe?

????

Actions

Copy link

Updated by Andrea Dell'Amico over 7 years ago

% Done changed from 0 to 30

I just increased the RAM of the three orientdb production servers to 6GB each. You can comfortably add another GB of heap on each server, and maybe play with the disk buffer parameters to relieve pressure from the memory.

Actions

Copy link