[hbase10] Still too many threads #691. #692

saintstack · 2016-04-08T23:19:21Z

Fix broken synchronization meant to guard against over-creation
of cluster connection instances on startup when multiple threads.

Fix broken synchronization meant to guard against over-creation of cluster connection instances on startup when multiple threads.

saintstack · 2016-04-08T23:30:00Z

#691 has write up on what this pull request addresses

manolama · 2016-04-09T00:18:37Z

hbase10/src/main/java/com/yahoo/ycsb/db/HBaseClient10.java

@@ -166,7 +176,9 @@ public void init() throws DBException {
    String table = com.yahoo.ycsb.workloads.CoreWorkload.table;
    try {
      final TableName tName = TableName.valueOf(table);
-      connection.getTable(tName).getTableDescriptor();


Ah so getTable() isn't thread safe in that it could create more connection objects?

I thought Connection was supposed to be threadsafe? Is this essentially a bug in the implementation? Any idea if it still exists in later HBase client versions?

I'm happy to demonstrate a workaround other applications need to be aware of, but I'd like to make sure we document why we're being more conservative than the HBase docs indicate we need to be.

saintstack · 2016-04-10T05:37:15Z

@manolama The root issue is that when lots of threads starting up, they all see the static 'connection' as null so each creates a new one but only one of which prevails... meantime a bunch of unreferenced connections are just hanging out (along w/ their zk client threads). The synchronization here on connection.getTable is just my being consistent synchronizing all use of connection so threads see the connection probably made by another and don't get a NPE on connection; probably gratuitous (getTable is safe).

@busbey connection is thread safe. This 'fix' is to ycsb HBaseClient trying to have a singleton instance of a connection. It was synchronizing on an Integer that it was then changing inside the synchronize block.... and the synchronization on the threadCount integer was supposed to be 'protecting' connection initialization. Its broken.

busbey · 2016-04-10T05:40:40Z

oh! the autoboxing of the integer changed the reference. of course. /me slaps forehead.

guess next time I won't advocate moving to something "simpler" than the Atomic classes. :/

saintstack · 2016-04-10T05:47:53Z

On Sat, Apr 9, 2016 at 10:40 PM, Sean Busbey notifications@github.com
wrote:

oh! the autoboxing of the integer changed the reference. of course. /me
slaps forehead.

guess next time I won't advocate moving to something "simpler" than the
Atomic classes. :/

—
You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#692 (comment)

Looking at hbase098, it is synchronizing on the AtomicInteger which would
do-the-right-thing. Should I make this patch do same as it?

busbey · 2016-04-10T05:49:11Z

The synchronization here on connection.getTable is just my being consistent synchronizing all use of connection so threads see the connection probably made by another and don't get a NPE on connection; probably gratuitous (getTable is safe).

This part shouldn't be needed, since we are synchronized within init and cleanup on changing the null/not-null state of connection.

If we leave the synchronized block in here, then we'll contend on the lock across all the threads whenever the table changes. With the current core workloads that will mean the first operation each thread does. Maybe not killer since just at startup, but it will be ugly since it'll be each thread's init + each threads first operation all hitting the same mutex.

busbey · 2016-04-10T05:49:22Z

hbase10/src/main/java/com/yahoo/ycsb/db/HBaseClient10.java

-  // Must be an object for synchronization and tracking running thread counts. 
-  private static Integer threadCount = 0;
+
+  private static AtomicInteger threadCount = new AtomicInteger(0);


should be final.

busbey · 2016-04-10T05:52:24Z

Looking at hbase098, it is synchronizing on the AtomicInteger which would
do-the-right-thing. Should I make this patch do same as it?

That's what this client did before #651. I don't really mind either way. I suppose it's nice for the implementations to be similar?

cmatser · 2016-04-10T06:43:50Z

👍

manolama · 2016-04-10T18:07:49Z

@saintstack Ah thank you! That damn autoboxing got me so it makes sense now.

cmatser · 2016-04-11T18:31:47Z

I'm tracking this issue for the release. Can you tell me what's the next step(s) before this can be merged? Thanks!

saintstack · 2016-04-11T18:39:42Z

I'm running the patch locally. Works for me. It will be different locking from what is in the hbase098 and hbase094 clients but it was different before I showed up. There is one +1 above. Need more?

busbey · 2016-04-11T21:53:44Z

the threadcount variable should be final. but I can clean that up and make things look like hbase098 in a follow-on.

busbey · 2016-04-11T21:56:47Z

shoot. just reread things more carefully and saw my comment about synchronizing around getTable.

probably fine? I'll add it to the follow-on. someone let me know if they think we should revert and fix now.

saintstack · 2016-04-11T22:21:27Z

Fine I'd say @busbey ... just startup. And we're synchronizing anyways.

Thanks for merge.

Fix broken synchronization meant to guard against over-creation Issue #692 of cluster connection instances on startup when multiple threads.

[hbase10] Still too many threads brianfrankcooper#691.

57e1ab5

Fix broken synchronization meant to guard against over-creation of cluster connection instances on startup when multiple threads.

manolama reviewed Apr 9, 2016
View reviewed changes

busbey reviewed Apr 10, 2016
View reviewed changes

busbey mentioned this pull request Apr 10, 2016

Release version 0.8.0 #678

Closed

busbey mentioned this pull request Apr 10, 2016

[googlebigtable] race condition in initialization caused by autoboxing #697

Closed

busbey merged commit 604c50d into brianfrankcooper:master Apr 11, 2016

busbey mentioned this pull request Apr 11, 2016

[hbase10] refactor thread counting to match hbase098 #701

Closed

cmatser pushed a commit that referenced this pull request Apr 11, 2016

[hbase10] Still too many threads #691.

5fd0858

Fix broken synchronization meant to guard against over-creation Issue #692 of cluster connection instances on startup when multiple threads.

risdenk mentioned this pull request Apr 12, 2016

[hbase10] Still too many threads #691

Closed

ghost mentioned this pull request May 5, 2016

Release version 0.9.0 #740

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hbase10] Still too many threads #691. #692

[hbase10] Still too many threads #691. #692

saintstack commented Apr 8, 2016

saintstack commented Apr 8, 2016

manolama Apr 9, 2016

busbey Apr 9, 2016

saintstack commented Apr 10, 2016

busbey commented Apr 10, 2016

saintstack commented Apr 10, 2016

busbey commented Apr 10, 2016

busbey Apr 10, 2016

busbey commented Apr 10, 2016

cmatser commented Apr 10, 2016

manolama commented Apr 10, 2016

cmatser commented Apr 11, 2016

saintstack commented Apr 11, 2016

busbey commented Apr 11, 2016

busbey commented Apr 11, 2016

saintstack commented Apr 11, 2016

[hbase10] Still too many threads #691. #692

[hbase10] Still too many threads #691. #692

Conversation

saintstack commented Apr 8, 2016

saintstack commented Apr 8, 2016

manolama Apr 9, 2016

Choose a reason for hiding this comment

busbey Apr 9, 2016

Choose a reason for hiding this comment

saintstack commented Apr 10, 2016

busbey commented Apr 10, 2016

saintstack commented Apr 10, 2016

busbey commented Apr 10, 2016

busbey Apr 10, 2016

Choose a reason for hiding this comment

busbey commented Apr 10, 2016

cmatser commented Apr 10, 2016

manolama commented Apr 10, 2016

cmatser commented Apr 11, 2016

saintstack commented Apr 11, 2016

busbey commented Apr 11, 2016

busbey commented Apr 11, 2016

saintstack commented Apr 11, 2016