Synchronized old 0.8 branch with master and Kafka 0.8.0-beta1 #51

mrtheb · 2013-10-01T19:28:09Z

What this contains:

Updated Kafka reference to 0.8.0-beta1 commit (looks dirty but it works)
Disabled tests using commit offset to kafka (not available in 0.8, maybe in trunk)
Updated doc, Kafka now requires calling ./sbt assembly-package-dependency
Updated tests & fixtures
Moved integration test for blocking api in a different test function as it was causing me trouble. Makes more sense to have it split as well.

Integration tests work, on my machine. I used both tox (with pytest) and nosetests directly (which I prefer to pytest), not tested on 2.6 though.

@mahendra, @mumrah mrtheb/kafka-python@2b016b6 is for a fix you probably want in master as well.

This will be easier to use in some cases where we have to get only a specified set of messages. This API uses the __iter__ API internally, but maintains the state to give back only the required set of messages API is - get_messages(count=1)

The auto commit timer is one-shot. After the first commit, it does not fire again. This ticket fixes the issue. Also, in util.ReentrantTimer(), some duplicate code was cleaned up

Auto commit timer is not periodic

Removed get_messages API, added test for get_pending

Conflicts: kafka/consumer.py

* When you initiate a producer with a non-existant queue, the queue is created. However this partition info is not reflected in KafkaClient() immediately. So, we wait for a second and try loading it again. Without this fix, if we do producer.send_messages() after creating a new queue, the library will throw a StopIteration exception. * In SimpleConsumer(), the defaults are not as mentioned in the comments. Fix this (or do we change the documentation?) * There was a problem with the way the consumer iterator worked. for eg: assume that there were 10 messages in the queue/topic and you iterate over it as - for msg in consumer: print (msg) At the end of this, 'offset' that is saved is 10. So, if you run the above loop again, the last message (10) is repeated. This can be fixed by adjusting the offset counter before fetching the message * Avoid some code repeat in consumer.commit() * Fix a bug in send_offset_commit_request() invocation in consumer.py * Fix missing imports

consumer.py and conn.py will be done later after pending merges

This alleviates IPv4 -vs- IPv6 issues in ZK and Kafka.

Is there a better way to do this?

Closes dpkp#28

Fix auto-commit issues with multi-threading

If there are no messages being consumed, the timer keeps creating new threads at the specified intervals. This may not be necessary. We can control this behaviour such that the timer thread is started only when a message is consumed

The previous commit optimized the commit thread such that the timer started only when there were messages to be consumed. This commit goes a step further and ensures the following: * Only one timer thread is created * The main app does not block on exit (waiting for timer thread to finish) This is ensured by having a single thread blocking on an event and keeps calling a function. We use events instead of time.sleep() so as to prevent the python interpreter from running every 50ms checking if the timer has expired (logic copied from threading.Timer)

In the current patch get_messages(count=1) would return zero messages the first time it is invoked after a consumer was initialized.

Support for async producer Merged locally, tests pass, +1

Conflicts: kafka/__init__.py kafka/consumer.py test/test_integration.py

Was hard coded to 1024 bytes which meant that larger messages were unconsumable since they would always get split causing the consumer to stop. It would probably be best to automatically retry truncated messages with a larger request size so you don't have to know your max message size ahead of time

Related to dpkp#42 Adds new ConsumerFetchSizeTooSmall exception that is thrown when `_decode_message_set_iter` gets a BufferUnderflowError but has not yet yielded a message In this event, SimpleConsumer will increase the fetch size by 1.5 and continue the fetching loop while _not_ increasing the offset (basically just retries the request with a larger fetch size) Once the consumer fetch size has been increased, it will remain increased while SimpleConsumer fetches from that partition

@jimjh

Allow a client id to be passed to the client +1 thanks, @jimjh

Also move the exceptions to common instead of util

correct typo in readme example

Update README.md

Small fixes in �## Multiprocess consumer example.

Update README.md

import bufferunderflow error

…Bytes)

mumrah · 2013-10-02T12:18:35Z

@mrtheb This pull request looks a little suspect (2,691 additions, 949 deletions). Can you rebase and open a new one?

mumrah · 2013-10-02T12:28:17Z

@mrtheb I should clarify

From you pull request description, I would expect to see changes to

kafka-src revision
docs
integration tests
Kafka fixture

Can you rebase these changes against "master"?

mrtheb · 2013-10-02T14:41:58Z

It's probably because I merged from master locally and pushed the result in the branch... dunno. I'll check it out again. My changes are at the bottom of the list.

mumrah · 2013-10-03T16:25:39Z

@mrtheb I have manually merged your changes. Tests passing locally, let's see how Travis does

mrtheb · 2013-10-03T19:19:48Z

@mumrah oh thanks, sorry you had to do that... I honestly never used rebase before, still learning about git workflows (using perforce at work). I have other pieces coming up, it's good to see all the activity lately

mumrah · 2013-10-03T19:22:13Z

@mrtheb no worries, I was happy to get the tests passing. A little git-fu is a small price ;)

mumrah and others added 30 commits April 11, 2013 16:03

Fixes dpkp#14

eff8d2b

Update README.md

6327ba3

New API for checking pending message count

38215b6

Missed a doc string

7ab7690

Auto commit timer is not periodic

8fc0407

The auto commit timer is one-shot. After the first commit, it does not fire again. This ticket fixes the issue. Also, in util.ReentrantTimer(), some duplicate code was cleaned up

Merge pull request dpkp#23 from mahendra/autocommit

904157b

Auto commit timer is not periodic

Adding a debug statement

bf8fc04

Closes dpkp#22

f4a326f

Removed get_messages API, added test for get_pending

Merge branch 'issue-22'

222ef82

Conflicts: kafka/consumer.py

Removing the bit about offsets

97962d4

PEP8-ify most of the files

2c257ee

consumer.py and conn.py will be done later after pending merges

Adding authors file

d0cb38c

Updating authors with links to github profiles

58288d9

Fix auto-commit issues with multi-threading

437889d

Finish making remaining files pep8 ready

6704050

Refactor and update integration tests

e073b33

Adhere to common python naming conventions

f444caa

Beautify codec.py

d7178e1

Split fixtures out to a separate file

2bd2dbc

toxify all the tests and use xfail marks

60200c6

Update README.md

800a512

Ignore MANIFEST

bdad6a1

Use 127.0.0.1 instead of localhost

4990381

This alleviates IPv4 -vs- IPv6 issues in ZK and Kafka.

Bootstrap distribute/tox with setuptools

dd109e2

Is there a better way to do this?

Merge branch 'issue-28'

40506c2

Closes dpkp#28

Merge pull request dpkp#29 from mahendra/threading

77b8301

Fix auto-commit issues with multi-threading

Spawn the commit thread only if necessary

a4601d3

If there are no messages being consumed, the timer keeps creating new threads at the specified intervals. This may not be necessary. We can control this behaviour such that the timer thread is started only when a message is consumed

mahendra and others added 21 commits June 28, 2013 13:59

Fix cases of single partition

c13ee1d

Add more cleanup in consumer.stop()

c54a2ed

Fix minor bug in offset management

1d278f0

In the current patch get_messages(count=1) would return zero messages the first time it is invoked after a consumer was initialized.

Merge pull request dpkp#33 from mahendra/asyncproducer

5684af4

Support for async producer Merged locally, tests pass, +1

Merge branch 'issue-35'

e297a7a

Conflicts: kafka/__init__.py kafka/consumer.py test/test_integration.py

Show alternative way of running tests in README.md

c3bce13

Documenting new behavior in CHANGES.md

0c732ca

allow a client id to be passed to the client

caf7c68

Merge pull request dpkp#45 from quixey/allow-client-id

9af7b81

Allow a client id to be passed to the client +1 thanks, @jimjh

Fix dpkp#44 Add missing exception class

c0d2cac

Also move the exceptions to common instead of util

Update README.md

3e68f94

correct typo in readme example

Merge pull request dpkp#47 from StevenLeRoux/master

2f6bcc2

Update README.md

Update README.md

3194d55

Small fixes in �## Multiprocess consumer example.

Merge pull request dpkp#48 from StevenLeRoux/master

2ed7638

Update README.md

import bufferunderflow error

f81d254

Merge pull request dpkp#49 from jimjh/import-buffer-underflow-error

c1a6b8e

import bufferunderflow error

merged from master c1a6b8e

d640ddf

Sync tests and fixtures with kafka 0.8.0-beta1 tag

8b9c7e5

Set FetchRequest MaxBytes value to bufsize instead of fetchsize (=Min…

2b016b6

…Bytes)

mumrah closed this Oct 2, 2013

mumrah mentioned this pull request Oct 2, 2013

Enable continuous integration with travis-ci. #53

Merged

huangcuiyang mentioned this pull request Jan 18, 2020

Consumer deadlock in coordinator when heartbeat request timeout #1985

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Synchronized old 0.8 branch with master and Kafka 0.8.0-beta1 #51

Synchronized old 0.8 branch with master and Kafka 0.8.0-beta1 #51

Uh oh!

mrtheb commented Oct 1, 2013

Uh oh!

mumrah commented Oct 2, 2013

Uh oh!

mumrah commented Oct 2, 2013

Uh oh!

mrtheb commented Oct 2, 2013

Uh oh!

mumrah commented Oct 3, 2013

Uh oh!

mrtheb commented Oct 3, 2013

Uh oh!

mumrah commented Oct 3, 2013

Uh oh!

Uh oh!

Synchronized old 0.8 branch with master and Kafka 0.8.0-beta1 #51

Synchronized old 0.8 branch with master and Kafka 0.8.0-beta1 #51

Uh oh!

Conversation

mrtheb commented Oct 1, 2013

Uh oh!

mumrah commented Oct 2, 2013

Uh oh!

mumrah commented Oct 2, 2013

Uh oh!

mrtheb commented Oct 2, 2013

Uh oh!

mumrah commented Oct 3, 2013

Uh oh!

mrtheb commented Oct 3, 2013

Uh oh!

mumrah commented Oct 3, 2013

Uh oh!

Uh oh!