Skip to content

KMeans clustering #403

Open
Open
@ZhenNan2016

Description

@ZhenNan2016

Regarding spann, I would like to ask a few questions, as follows:

  1. Regarding KMeans clustering, what is the limit for each cluster center? If it exceeds this limit, will it be re divided into one or multiple layers?
  2. When does the centroid in memory need to be updated after clustering is completed?
  3. After completing clustering, should the new vector data be written directly into posting list in the disk or stored as centroids in memory?
  4. When will KMeans clustering be done again?
  5. There are too many clusters, will they be clustered with KMeans clustering algorithms again?
  6. What is the difference between sptag and sptag++ ?
  7. One question about Hierarchical data partition and partial search, as follows:
    Does each query require two steps: 1) Distributed dispatch and 2) Local Search?
    What are the transactions for these two steps?
    image

Looking forward to your reply.
Thanks very much.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions