Open
Description
Regarding spann, I would like to ask a few questions, as follows:
- Regarding KMeans clustering, what is the limit for each cluster center? If it exceeds this limit, will it be re divided into one or multiple layers?
- When does the centroid in memory need to be updated after clustering is completed?
- After completing clustering, should the new vector data be written directly into posting list in the disk or stored as centroids in memory?
- When will KMeans clustering be done again?
- There are too many clusters, will they be clustered with KMeans clustering algorithms again?
- What is the difference between sptag and sptag++ ?
- One question about Hierarchical data partition and partial search, as follows:
Does each query require two steps: 1) Distributed dispatch and 2) Local Search?
What are the transactions for these two steps?
Looking forward to your reply.
Thanks very much.
Metadata
Metadata
Assignees
Labels
No labels