Redis

Redis is an open-source, networked, in-memory, key-value data store with optional durability, written in ANSI C.

Cache
If you have not already, you'll need to configure a Redis instance and install a Redis client library for PHP. Most environments require the phpredis PHP extension. On Debian / Ubuntu, you can install the requirements with the following command:

In your "LocalSettings.php" file, set:


 * Parameters explained
 * : An array of server names. A server name may be a hostname, a hostname/port combination or the absolute path of a UNIX socket. If a hostname is specified but no port, the standard port number 6379 will be used. Arrays keys can be used to specify the tag to hash on in place of the host/port. Required.
 * : The timeout for new connections, in seconds. Optional, default is 1 second.
 * : Set this to true to allow connections to persist across multiple web requests. False by default.
 * : The authentication password, will be sent to Redis in clear text. Optional, if it is unspecified, no AUTH command will be sent.
 * : If this is false, then each key will be mapped to a single server, and if that server is down, any requests for that key will fail. If this is true, a connection failure will cause the client to immediately try the next server in the list (as determined by a consistent hashing algorithm). True by default. This has the potential to create consistency issues if a server is slow enough to flap, for example if it is in swap death. True by default.

You will now be able to acquire a Redis object cache object via. If you'd like to use Redis as the default cache for various data, you may set any of the following configuration options:

Job queue

 * Parameters explained
 * : An array of parameters to RedisConnectionPool::__construct. Note that the serializer option is ignored as "none" is always used. If the same Redis server is used as for, the Redis password needs to be set here as well (see   config above).
 * : A hostname/port combination or the absolute path of a UNIX socket. If a hostname is specified but no port, the standard port number 6379 will be used. Required.
 * : The type of compression to use; one of (none,gzip).
 * : Currently it doesn't support setting it to false.

From that moment, jobs will be delivered to the Redis instance on the specified server.

Automatic handling of job recycling and abandons
Abandoned jobs aren't purged from redis, and failed and delayed jobs need to be rescheduled. This requires a special job runner service.

Clone the git repository https://github.com/wikimedia/mediawiki-services-jobrunner

Create a configuration file named :

Configure a daemon to run this at server start:

php redisJobChronService --config-file=config.json

The daemon itself supports running jobs from the queue, but that's not very well documented. See also Nad's docu on setting up the job queue (partially outdated).

MediaWiki & Wikimedia use cases for Redis

 * History of job queue runners at WMF on Wikitech.

General

 * Official site (see esp. Introduction to Redis)
 * The Redis article on the English Wikipedia.
 * Redis Watch - an e-mail round-up of Redis news, articles, tools and libraries
 * Redis/INCR
 * Getting to Know Redis
 * Redis, from the Ground Up
 * Redis and Relational Data
 * Redis Cookbook (book; not great, but see ch. "Analytics and Time-Based Data")
 * Interview with Salvatore Sanfilippo (code-oriented but still useful)
 * Redis DB (Google Group)

Analytics

 * Redis at Disqus (their entire analytics platform runs on Redis)
 * Effective Web App Analytics with Redis
 * How YouPorn uses Redis (video)
 * Realtime metrics using Redis bitmaps

Tooling

 * Redily fully-featured, cross-platform Redis GUI Client
 * Redsmin a real-time, atomic, performant administration and monitoring interface for Redis
 * redis-py is the library of choice for Python
 * Getting Started: Redis and Python
 * Redis and Python (presentation slides)
 * Resque for jobs
 * Redisco, a Python ORM for Redis
 * py-analytics (I haven't used this)
 * redis-bitops Ruby gem for sparse bitmap operations

Informed Opinions

 * Antirez: You Need To Think In Terms Of Organizing Your Data For Fetching

Miscellaneous

 * Storing hundreds of millions of simple key-value pairs (how Instagram uses Redis)
 * Key performance metrics to monitor for Redis