Sharding Use Cases

What specific use cases do we want to address with db partioning (and other techniques) that are challenging to scale? List here for discussion.

  • video site (e.g., youtube) (also, GridFS scale-up)
    • seems straightforward: partition by video
    • for related videos feature, see search below
  • social networking (e.g., facebook)
    • this can be quite hard to partition, because it is difficult to cluster people.
  • very high RPS sites with small datasets
    • N replicas, instead of partioning, might help here
      • replicas only work if the dataset is really small as we are using/wasting the same RAM on each replica. thus, partioning might help us with ram cache efficiency even if entire data set fits on one or two drives.
  • twitter
  • search & tagging

Log Processing

Use cases related to map-reduce like things.

  • massive sort
  • top N queries per day
  • compare data from two nonadjacent time periods

Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.

PLEASE POST QUESTIONS IN THE FORUMS: http://groups.google.com/group/mongodb-user. Post tips and clarifications here.

blog comments powered by Disqus