[Ruby 1.9 - Bug #5258][Open] SizedQueueにBug #5195と同様のバグ

Issue #5258 has been reported by Masaki M…


Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Masaki M.
Status: Open
Priority: Normal
Assignee:
Category: lib
Target version: 1.9.x
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end

Issue #5258 has been updated by Motohiro KOSAKI.

Status changed from Open to Assigned
Assignee set to Motohiro KOSAKI


Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Masaki M.
Status: Assigned
Priority: Normal
Assignee: Motohiro KOSAKI
Category: lib
Target version: 1.9.x
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end

Issue #5258 has been updated by Tomoyuki C…

File queue.patch added

別件の修正中にパッチを書いていたのですが、このチケットと同じ問題だと気がつきました。
#5195 のぶんも同じですがパッチを添付します。
wakeup や例外で抜けた時に @waiting に残ってしまうと次の Queue#push の時に意図せず Thread#wakeup
が呼ばれてしまったり、実際に待っている Thread に wakeup が呼ばれなくなったりするので、残さないようにしたほうが良いと思います。

Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Masaki M.
Status: Assigned
Priority: Normal
Assignee: Motohiro KOSAKI
Category: lib
Target version: 1.9.x
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end

Issue #5258 has been updated by nobu (Nobuyoshi N.).

It seems natural because the only thread is about to sleep.

Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Glass_saga (Masaki M.)
Status: Assigned
Priority: Normal
Assignee: kosaki (Motohiro KOSAKI)
Category: lib
Target version: 2.0.0
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end

Issue #5258 has been updated by rklemme (Robert K.).

This is also present in 1.9.3 and there is an even simpler test case:

$ ruby19 -r thread -e ‘q=SizedQueue.new 10;1_000_000.times {|i| p
i;q.enq i}’
0
1
2
3
4
5
6
7
8
9
10
/opt/lib/ruby/1.9.1/thread.rb:301:in sleep': deadlock detected (fatal) from /opt/lib/ruby/1.9.1/thread.rb:301:in block in push’
from internal:prelude:10:in synchronize' from /opt/lib/ruby/1.9.1/thread.rb:297:in push’
from -e:1:in block in <main>' from -e:1:in times’
from -e:1:in `’

rklemme@padrklemme2
/cygdrive/c/SCMws/RKlemme/JavaProducts_oslee_ngcp_dev_R3.3_be4rb
$ ruby19 -v
ruby 1.9.3p125 (2012-02-16) [i386-cygwin]

Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Glass_saga (Masaki M.)
Status: Assigned
Priority: Normal
Assignee: kosaki (Motohiro KOSAKI)
Category: lib
Target version: 2.0.0
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end

Hello,

2012/4/11 rklemme (Robert K.) removed_email_address@domain.invalid:

nobu (Nobuyoshi N.) wrote:

It seems natural because the only thread is about to sleep.

I would expect the thread to block indefinitely. Signalling an error here seems
to try to be too smart. Even in absence of other threads I can imagine conditions
under which data is read from the queue (for example a signal handler).

Interesting. The error is indeed “false positive.” I did not noticed
signal handler.

But I’m not enthusiastic for your proposal. It is too conservative.
It will increase too many “false negative.”

But even if an error is signaled here, it is certainly not a deadlock - for that
you need at least two threads. This is rather something like “only blocking thread
is suspended (with no chance to wake up)”. Maybe that error should be controllable
via a switch, e.g.

Thread.single_thread_block_is_error = true # default false

Do you have any practical case where you get bothered by the “false
positive”?
If so, please open a feature request ticket with the use case.

Issue #5258 has been updated by rklemme (Robert K.).

nobu (Nobuyoshi N.) wrote:

It seems natural because the only thread is about to sleep.

I would expect the thread to block indefinitely. Signalling an error
here seems to try to be too smart. Even in absence of other threads I
can imagine conditions under which data is read from the queue (for
example a signal handler).

But even if an error is signaled here, it is certainly not a deadlock -
for that you need at least two threads. This is rather something like
“only blocking thread is suspended (with no chance to wake up)”. Maybe
that error should be controllable via a switch, e.g.

Thread.single_thread_block_is_error = true # default false

Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Glass_saga (Masaki M.)
Status: Assigned
Priority: Normal
Assignee: kosaki (Motohiro KOSAKI)
Category: lib
Target version: 2.0.0
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end

Issue #5258 has been updated by rklemme (Robert K.).

Hi again,

mame (Yusuke E.) wrote:

2012/4/11 rklemme (Robert K.) removed_email_address@domain.invalid:

nobu (Nobuyoshi N.) wrote:

It seems natural because the only thread is about to sleep.

I would expect the thread to block indefinitely. Signalling an error here
seems to try to be too smart. Even in absence of other threads I can imagine
conditions under which data is read from the queue (for example a signal handler).

Interesting. The error is indeed “false positive.” I did not noticed
signal handler.

I was lucky because I was just sensitized by Eric’s recent article about
signal handling in Ruby. :slight_smile:

But I’m not enthusiastic for your proposal. It is too conservative.
It will increase too many “false negative.”

You mean processes with no unblocked threads would go unnoticed? What
about the effort for JRuby and others to mimic this behavior?

But even if an error is signaled here, it is certainly not a deadlock - for
that you need at least two threads. This is rather something like “only blocking
thread is suspended (with no chance to wake up)”. Maybe that error should be
controllable via a switch, e.g.

Thread.single_thread_block_is_error = true # default false

Do you have any practical case where you get bothered by the “false positive”?
If so, please open a feature request ticket with the use case.

Not at the moment. The case I thought I had evaporated because it was a
“silent thread death” case (i.e. Thread.abort_on_exception was at
default - false - and the only other thread had silently died where I
thought it would still be running). But a better error message than
“deadlock” as I have suggested above would have made me find the source
of the issue much more quickly because it had hinted at the real
problem.

I think I’ll open a feature request to at least change the error message
in case there is no other thread left. Btw, is this really a deadlock
detection or does the code only check for blocked threads? From the
code in check_deadlock_i() it is not totally obvious to me; I think the
check only ensures at least one live thread - at least it does not seem
to try to find the deadlock loop (as e.g. some relational databases do).

Kind regards

robert

Bug #5258: SizedQueueにBug #5195と同様のバグ

Author: Glass_saga (Masaki M.)
Status: Assigned
Priority: Normal
Assignee: kosaki (Motohiro KOSAKI)
Category: lib
Target version: 2.0.0
ruby -v: ruby 1.9.4dev (2011-09-01 trunk 33157) [x86_64-linux]

=begin
[Bug
#5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。

require ‘thread’

sq = SizedQueue.new(1)
sq.push(0)

t1 = Thread.start { sq.push(1) ; sleep }

nil until t1.stop?
t1.wakeup
nil until t1.stop?

t2 = Thread.start { sq.push(2) }

nil until t1.stop? && t2.stop?

p t1, t2
sq.instance_eval{ p @queue_wait }

3.times{ sq.pop }

t2.join

上記のコードを実行すると、

#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>,
#<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in block in pop’
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in pop’
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in block in ’
from sized_queue.rb:19:in times' from sized_queue.rb:19:in

となります。
場当たり的なものですが[Bug
#5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。

=end