Merge "[arch-design] Convert compute focused section"

2015-11-24 15:49:03 +00:00 · 2015-11-24 15:49:03 +00:00 · 2ac87997c5
parent ba2b474dc3 aeca7c78ad
commit 2ac87997c5
2 changed files with 127 additions and 0 deletions
--- a/doc/arch-design-rst/source/compute-focus-prescriptive-examples.rst
+++ b/doc/arch-design-rst/source/compute-focus-prescriptive-examples.rst
@ -0,0 +1,126 @@
+=====================
+Prescriptive examples
+=====================
+
+The Conseil Européen pour la Recherche Nucléaire (CERN), also known as
+the European Organization for Nuclear Research, provides particle
+accelerators and other infrastructure for high-energy physics research.
+
+As of 2011 CERN operated these two compute centers in Europe with plans
+to add a third.
+
+-----------------------+------------------------+
+| Data center           | Approximate capacity   |
+=======================+========================+
+| Geneva, Switzerland   | -  3.5 Mega Watts      |
+|                       |                        |
+|                       | -  91000 cores         |
+|                       |                        |
+|                       | -  120 PB HDD          |
+|                       |                        |
+|                       | -  100 PB Tape         |
+|                       |                        |
+|                       | -  310 TB Memory       |
+-----------------------+------------------------+
+| Budapest, Hungary     | -  2.5 Mega Watts      |
+|                       |                        |
+|                       | -  20000 cores         |
+|                       |                        |
+|                       | -  6 PB HDD            |
+-----------------------+------------------------+
+
+To support a growing number of compute-heavy users of experiments
+related to the Large Hadron Collider (LHC), CERN ultimately elected to
+deploy an OpenStack cloud using Scientific Linux and RDO. This effort
+aimed to simplify the management of the center's compute resources with
+a view to doubling compute capacity through the addition of a data
+center in 2013 while maintaining the same levels of compute staff.
+
+The CERN solution uses :term:`cells <cell>` for segregation of compute
+resources and for transparently scaling between different data centers.
+This decision meant trading off support for security groups and live
+migration. In addition, they must manually replicate some details, like
+flavors, across cells. In spite of these drawbacks cells provide the
+required scale while exposing a single public API endpoint to users.
+
+CERN created a compute cell for each of the two original data centers
+and created a third when it added a new data center in 2013. Each cell
+contains three availability zones to further segregate compute resources
+and at least three RabbitMQ message brokers configured for clustering
+with mirrored queues for high availability.
+
+The API cell, which resides behind a HAProxy load balancer, is in the
+data center in Switzerland and directs API calls to compute cells using
+a customized variation of the cell scheduler. The customizations allow
+certain workloads to route to a specific data center or all data
+centers, with cell RAM availability determining cell selection in the
+latter case.
+
+.. figure:: figures/Generic_CERN_Example.png
+
+There is also some customization of the filter scheduler that handles
+placement within the cells:
+
+ImagePropertiesFilter
+ Provides special handling depending on the guest operating system in
+ use (Linux-based or Windows-based).
+
+ProjectsToAggregateFilter
+ Provides special handling depending on which project the instance is
+ associated with.
+
+default_schedule_zones
+ Allows the selection of multiple default availability zones, rather
+ than a single default.
+
+A central database team manages the MySQL database server in each cell
+in an active/passive configuration with a NetApp storage back end.
+Backups run every 6 hours.
+
+Network architecture
+~~~~~~~~~~~~~~~~~~~~
+
+To integrate with existing networking infrastructure, CERN made
+customizations to legacy networking (nova-network). This was in the form
+of a driver to integrate with CERN's existing database for tracking MAC
+and IP address assignments.
+
+The driver facilitates selection of a MAC address and IP for new
+instances based on the compute node where the scheduler places the
+instance.
+
+The driver considers the compute node where the scheduler placed an
+instance and selects a MAC address and IP from the pre-registered list
+associated with that node in the database. The database updates to
+reflect the address assignment to that instance.
+
+Storage architecture
+~~~~~~~~~~~~~~~~~~~~
+
+CERN deploys the OpenStack Image service in the API cell and configures
+it to expose version 1 (V1) of the API. This also requires the image
+registry. The storage back end in use is a 3 PB Ceph cluster.
+
+CERN maintains a small set of Scientific Linux 5 and 6 images onto which
+orchestration tools can place applications. Puppet manages instance
+configuration and customization.
+
+Monitoring
+~~~~~~~~~~
+
+CERN does not require direct billing, but uses the Telemetry service to
+perform metering for the purposes of adjusting project quotas. CERN uses
+a sharded, replicated, MongoDB back-end. To spread API load, CERN
+deploys instances of the nova-api service within the child cells for
+Telemetry to query against. This also requires the configuration of
+supporting services such as keystone, glance-api, and glance-registry in
+the child cells.
+
+.. figure:: figures/Generic_CERN_Architecture.png
+
+Additional monitoring tools in use include
+`Flume <http://flume.apache.org/>`__, `Elastic
+Search <http://www.elasticsearch.org/>`__,
+`Kibana <http://www.elasticsearch.org/overview/kibana/>`__, and the CERN
+developed `Lemon <http://lemon.web.cern.ch/lemon/index.shtml>`__
+project.
--- a/doc/arch-design-rst/source/compute-focus.rst
+++ b/doc/arch-design-rst/source/compute-focus.rst
@ -8,6 +8,7 @@ Compute focused
   compute-focus-technical-considerations.rst
   compute-focus-operational-considerations.rst
   compute-focus-architecture.rst
+   compute-focus-prescriptive-examples.rst

 Compute-focused clouds are a specialized subset of the general
 purpose OpenStack cloud architecture. A compute-focused cloud