Freigeben über


PDC 2008: Project Velocity Under the Hood

(Presenters: Anil Nori and Murali Krishnaprasad)

What they said:

Learn about the architecture of Velocity, Microsoft's main memory distributed caching framework. Hear how Velocity was built to meet the performance, scale, latency, and availability requirements of large scale enterprise and web applications. Learn about Velocity components and discuss design tradeoffs and mechanisms for in-memory storage, data placement, and data replication for performance, scale, and availability. Also, hear how Velocity provides database capabilities like LINQ support, indexing, concurrency control, and data consistency.

What I said:

Velocity is a caching engine for low latency, high scale, and high availability of data between many readers and writers. The cache can either be organized using replication of the data among all the cache servers or by partitioning so that different parts of the data are available at different cache servers. The full cache is augmented with a local cache to improve performance for frequently retrieved data. Entries are stored in the local cache using a deserialized format and the local cache uses update notifications to synchronize with the full cache.

A partitioned cache is divided into primary and secondary storage of items. One of the cache machines is elected to hold a global partition map that other cache machines share. Routing tables use the partition map to direct lookup requests for a range of hashed keys. Data can be retrieved from its primary storage or by examining a quorum of secondary storage locations. If the primary storage fails, the partition manager designates a new primary storage location from among the secondary storage locations. Data have a sequence identifier to track updates and to implement update subscriptions and notifications. Subscribers poll cache machines for interesting event notifications. When a cache machine is promoted to be the primary storage location, the latest sequence number is checked so that another machine may be promoted instead if the originally selected location is out of date.

Caches are embeddable to give better performance in some scenarios at the cost of load balancing and optimized locality of data. Embedding generally works better for replicated caches than partitioned caches. The cache uses either pessimistic locking for updates or optimistic updates using the sequence identifier to avoid locking by doing retries. To control eviction of items from the cache, policies include expiration of items, least recently used eviction of items, and memory pressure based culling.

Caching is an important optimization for enterprise and web scale service implementations. Velocity is a caching engine for low latency, high scale, and high availability of data between many readers and writers. The current caching engine is hostable to build replicated, partitioned, and embedded caches for applications. This talk focuses on the implementation details of updating and retrieving items from the cache. In the future, the caching engine may potentially be exposed as a cloud caching service or support cloud applications that require caching.