Disk quota exceeded

Apache's JIRA Issue Tracker | Ramnatthan Alagappan | 8 months ago
Click on the to mark the solution that helps you, Samebug will learn from it.
As a community member, you’ll be rewarded for you help.
  1. 0

    ZooKeeper cluster completely stalls with *no* transactions making progress when a storage related error (such as *ENOSPC, EDQUOT, EIO*) is encountered by the current *leader*. Surprisingly, the same errors in some circumstances cause the node to completely crash and therefore allowing other nodes in the cluster to become the leader and make progress with transactions. Interestingly, the same errors if encountered while initializing a new log file causes the current leader to go to weird state (but does not crash) where it thinks it is the leader (and so does not allow others to become the leader). *This causes the entire cluster to freeze. * Here is the stacktrace of the leader: ------------------------------------------------ 2016-07-11 15:42:27,502 [myid:3] - INFO [SyncThread:3:FileTxnLog@199] - Creating new log file: log.200000001 2016-07-11 15:42:27,505 [myid:3] - ERROR [SyncThread:3:ZooKeeperCriticalThread@49] - Severe unrecoverable error, from thread : SyncThread:3 Disk quota exceeded at Method) at at at at org.apache.zookeeper.server.persistence.FileTxnLog.append( at org.apache.zookeeper.server.persistence.FileTxnSnapLog.append( at org.apache.zookeeper.server.ZKDatabase.append( at ------------------------------------------------ From the trace and the code, it looks like the problem happens only when a new log file is initialized and only when there are errors in two cases: 1. Error during the append of *log header*. 2. Error during *padding zero bytes to the end of the log*. If similar errors happen when writing some other blocks of data, then the node just completely crashes allowing others to be elected as a new leader. These two blocks of the newly created log file are special as they take a different error recovery code path -- the node does not completely crash but rather certain threads are killed but supposedly the quorum holding thread stays up thereby preventing others to become the new leader. This causes the other nodes to think that there is no problem with the leader but the cluster just becomes unavailable for any subsequent operations such as read/write.

    Apache's JIRA Issue Tracker | 8 months ago | Ramnatthan Alagappan Disk quota exceeded
  2. 0

    AppScale startup hangs when there is no disk space left

    GitHub | 4 years ago | jovanchohan No space left on device
  3. 0

    The disk that ZooKeeper was using filled up. During a snapshot write, I got the following exception 2013-01-16 03:11:14,098 - ERROR [SyncThread:0:SyncRequestProcessor@151] - Severe unrecoverable error, exiting No space left on device at Method) at at at at org.apache.zookeeper.server.persistence.FileTxnLog.commit( at org.apache.zookeeper.server.persistence.FileTxnSnapLog.commit( at org.apache.zookeeper.server.ZKDatabase.commit( at org.apache.zookeeper.server.SyncRequestProcessor.flush( at Then many subsequent exceptions like: 2013-01-16 15:02:23,984 - ERROR [main:Util@239] - Last transaction was partial. 2013-01-16 15:02:23,985 - ERROR [main:ZooKeeperServerMain@63] - Unexpected exception, exiting abnormally at at org.apache.jute.BinaryInputArchive.readInt( at org.apache.zookeeper.server.persistence.FileHeader.deserialize( at org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.inStreamCreated( at org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.createInputArchive( at org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.goToNextLog( at org.apache.zookeeper.server.persistence.FileTxnLog$ at org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.init( at org.apache.zookeeper.server.persistence.FileTxnLog$FileTxnIterator.<init>( at at org.apache.zookeeper.server.persistence.FileTxnSnapLog.restore( at org.apache.zookeeper.server.ZKDatabase.loadDataBase( at org.apache.zookeeper.server.ZooKeeperServer.loadData( at org.apache.zookeeper.server.ZooKeeperServer.startdata( at org.apache.zookeeper.server.NIOServerCnxnFactory.startup( at org.apache.zookeeper.server.ZooKeeperServerMain.runFromConfig( at org.apache.zookeeper.server.ZooKeeperServerMain.initializeAndRun( at org.apache.zookeeper.server.ZooKeeperServerMain.main( at org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun( at org.apache.zookeeper.server.quorum.QuorumPeerMain.main( It seems to me that writing the transaction log should be fully atomic to avoid such situations. Is this not the case?

    Apache's JIRA Issue Tracker | 4 years ago | David Arthur No space left on device
  4. Speed up your debug routine!

    Automated exception search integrated into your IDE

  1. rexgreenza 24 times, last 6 months ago
  2. abrazeneb 1 times, last 9 months ago
2 unregistered visitors
Not finding the right solution?
Take a tour to get the most out of Samebug.

Tired of useless tips?

Automated exception search integrated into your IDE

Root Cause Analysis


    Disk quota exceeded

  2. Java RT
    1. Method)
    4 frames
  3. Zookeeper
    1. org.apache.zookeeper.server.persistence.FileTxnLog.append(
    2. org.apache.zookeeper.server.persistence.FileTxnSnapLog.append(
    3. org.apache.zookeeper.server.ZKDatabase.append(
    4 frames