Thursday, May 15, 2014

How to shorten Hbase MTTR

Refer to Introduction to HBase Mean Time to Recovery (MTTR):

In short:

1. Detecting node failures

change hbase.zookeeper.timeout to 60000 in hbase-site.xml

2. Recovering in-progress writes

hdfs-site.xml
><!-- stale mode - 1.2+ -->

<property>
   <name>dfs.namenode.avoid.read.stale.datanode</name>
   <value>true</value>
</property>

<property>
   <name>dfs.namenode.avoid.write.stale.datanode</name>
   <value>true</value>
</property>

<property>
   <name>dfs.namenode.write.stale.datanode.ratio</name>
   <value>1.0f</value>
</property>

<!-- stale mode - branch 1.1.1+ -->

<property>
   <name>dfs.namenode.check.stale.datanode</name>
   <value>true</value>
</property>

No comments:

Post a Comment

Popular Posts