Continue work on standardizing osprofiler docs

Add actual sections and files and remove the main index.rst just being a symlink of the README.rst file and have that README.rst file now be formatted like the other oslo projects. Change-Id: I7ec12eef59bfbc2434a9905fe6e1ee4b9e3736e5
2016-03-24 20:03:07 -07:00 · 2016-03-24 20:03:07 -07:00 · e391c61a3e
parent a007a00807
commit e391c61a3e
6 changed files with 399 additions and 341 deletions
--- a/README.rst
+++ b/README.rst
@ -1,340 +1,23 @@
-OSProfiler
-==========
-
-OSProfiler is an OpenStack cross-project profiling library.
-
-
-Background
----------
-
-OpenStack consists of multiple projects. Each project, in turn, is composed of
-multiple services. To process some request, e.g. to boot a virtual machine,
-OpenStack uses multiple services from different projects. In the case something
-works too slowly, it's extremely complicated to understand what exactly goes
-wrong and to locate the bottleneck.
-
-To resolve this issue, we introduce a tiny but powerful library,
-**osprofiler**, that is going to be used by all OpenStack projects and their
-python clients. To be able to generate 1 trace per request, that goes through
-all involved services, and builds a tree of calls.
-
-Why not cProfile and etc?
-------------------------
-
-**The scope of this library is quite different:**
-
-* We are interested in getting one trace of points from different service,
-  not tracing all python calls inside one process.
-
-* This library should be easy integratable in OpenStack. This means that:
-
-  * It shouldn't require too many changes in code bases of integrating
-    projects.
-
-  * We should be able to turn it off fully.
-
-  * We should be able to keep it turned on in lazy mode in production
-    (e.g. admin should be able to "trace" on request).
-
-
-OSprofiler API
--------------
-
-There are a couple of things that you should know about API before using it.
-
-
-* **4 ways to add a new trace point**
-
-    .. parsed-literal::
-
-        from osprofiler import profiler
-
-        def some_func():
-            profiler.start("point_name", {"any_key": "with_any_value"})
-            # your code
-            profiler.stop({"any_info_about_point": "in_this_dict"})
-
-
-        @profiler.trace("point_name",
-                        info={"any_info_about_point": "in_this_dict"},
-                        hide_args=False)
-        def some_func2(*args, **kwargs):
-            # If you need to hide args in profile info, put hide_args=True
-            pass
-
-        def some_func3():
-            with profiler.Trace("point_name",
-                                info={"any_key": "with_any_value"}):
-                # some code here
-
-        @profiler.trace_cls("point_name", info={}, hide_args=False,
-                            trace_private=False)
-        class TracedClass(object):
-
-            def traced_method(self):
-                pass
-
-            def _traced_only_if_trace_private_true(self):
-                 pass
-
-* **How profiler works?**
-
-  * **@profiler.Trace()** and **profiler.trace()** are just syntax sugar,
-    that just calls **profiler.start()** & **profiler.stop()** methods.
-
-  * Every call of **profiler.start()** & **profiler.stop()** sends to
-    **collector** 1 message. It means that every trace point creates 2 records
-    in the collector. *(more about collector & records later)*
-
-  * Nested trace points are supported. The sample below produces 2 trace points:
-
-      .. parsed-literal::
-
-          profiler.start("parent_point")
-          profiler.start("child_point")
-          profiler.stop()
-          profiler.stop()
-
-      The implementation is quite simple. Profiler has one stack that contains
-      ids of all trace points. E.g.:
-
-      .. parsed-literal::
-
-          profiler.start("parent_point") # trace_stack.push(<new_uuid>)
-                                         # send to collector -> trace_stack[-2:]
-
-          profiler.start("parent_point") # trace_stack.push(<new_uuid>)
-                                         # send to collector -> trace_stack[-2:]
-          profiler.stop()                # send to collector -> trace_stack[-2:]
-                                         # trace_stack.pop()
-
-          profiler.stop()                # send to collector -> trace_stack[-2:]
-                                         # trace_stack.pop()
-
-      It's simple to build a tree of nested trace points, having
-      **(parent_id, point_id)** of all trace points.
-
-* **Process of sending to collector**
-
-  Trace points contain 2 messages (start and stop). Messages like below are
-  sent to a collector:
-
-  .. parsed-literal::
-    {
-        "name": <point_name>-(start|stop)
-        "base_id": <uuid>,
-        "parent_id": <uuid>,
-        "trace_id": <uuid>,
-        "info": <dict>
-    }
-
-   * base_id - <uuid> that is equal for all trace points that belong
-               to one trace, this is done to simplify the process of retrieving
-               all trace points related to one trace from collector
-   * parent_id - <uuid> of parent trace point
-   * trace_id - <uuid> of current trace point
-   * info - the dictionary that contains user information passed when calling
-            profiler **start()** & **stop()** methods.
-
-
-
-* **Setting up the collector.**
-
-    The profiler doesn't include a trace point collector. The user/developer
-    should instead provide a method that sends messages to a collector. Let's
-    take a look at a trivial sample, where the collector is just a file:
-
-    .. parsed-literal::
-
-        import json
-
-        from osprofiler import notifier
-
-        def send_info_to_file_collector(info, context=None):
-            with open("traces", "a") as f:
-                f.write(json.dumps(info))
-
-        notifier.set(send_info_to_file_collector)
-
-    So now on every **profiler.start()** and **profiler.stop()** call we will
-    write info about the trace point to the end of the **traces** file.
-
-
-* **Initialization of profiler.**
-
-    If profiler is not initialized, all calls to **profiler.start()** and
-    **profiler.stop()** will be ignored.
-
-    Initialization is a quite simple procedure.
-
-    .. parsed-literal::
-
-        from osprofiler import profiler
-
-        profiler.init("SECRET_HMAC_KEY", base_id=<uuid>, parent_id=<uuid>)
-
-   ``SECRET_HMAC_KEY`` - will be discussed later, because it's related to the
-    integration of OSprofiler & OpenStack.
-
-    **base_id** and **trace_id** will be used to initialize stack_trace in
-    profiler, e.g. stack_trace = [base_id, trace_id].
-
-
-* **OSProfiler CLI.**
-
-  To make it easier for end users to work with profiler from CLI, osprofiler
-  has entry point that allows them to retrieve information about traces and
-  present it in human readable from.
-
-  Available commands:
-
-  * Help message with all available commands and their arguments:
-
-      .. parsed-literal::
-
-          $ osprofiler -h/--help
-
-  * OSProfiler version:
-
-      .. parsed-literal::
-
-          $ osprofiler -v/--version
-
-  * Results of profiling can be obtained in JSON (option: ``--json``) and HTML
-    (option: ``--html``) formats:
-
-      .. parsed-literal::
-
-          $ osprofiler trace show <trace_id> --json/--html
-
-      hint: option ``--out`` will redirect result of ``osprofiler trace show``
-      in specified file:
-
-      .. parsed-literal::
-
-          $ osprofiler trace show <trace_id> --json/--html --out /path/to/file
-
-Integration with OpenStack
--------------------------
-
-There are 4 topics related to integration OSprofiler & `OpenStack`_:
-
-* **What we should use as a centralized collector?**
-
-  We decided to use `Ceilometer`_, because:
-
-  * It's already integrated in OpenStack, so it's quite simple to send
-    notifications to it from all projects.
-
-  * There is an OpenStack API in Ceilometer that allows us to retrieve all
-    messages related to one trace. Take a look at
-    *osprofiler.parsers.ceilometer:get_notifications*
-
-
-* **How to setup profiler notifier?**
-
-  We decided to use olso.messaging Notifier API, because:
-
-  * `oslo.messaging`_ is integrated in all projects
-
-  * It's the simplest way to send notification to Ceilometer, take a
-    look at: *osprofiler.notifiers.messaging.Messaging:notify* method
-
-  * We don't need to add any new `CONF`_ options in projects
-
-
-* **How to initialize profiler, to get one trace across all services?**
-
-    To enable cross service profiling we actually need to do send from caller
-    to callee (base_id & trace_id). So callee will be able to init its profiler
-    with these values.
-
-    In case of OpenStack there are 2 kinds of interaction between 2 services:
-
-    * REST API
-
-        It's well known that there are python clients for every project,
-        that generate proper HTTP requests, and parse responses to objects.
-
-        These python clients are used in 2 cases:
-
-        * User access -> OpenStack
-
-        * Service from Project 1 would like to access Service from Project 2
-
-
-        So what we need is to:
-
-        * Put in python clients headers with trace info (if profiler is inited)
-
-        * Add `OSprofiler WSGI middleware`_ to your service, this initializes
-          the profiler, if and only if there are special trace headers, that
-          are signed by one of the HMAC keys from api-paste.ini (if multiple
-          keys exist the signing process will continue to use the key that was
-          accepted during validation).
-
-          * The common items that are used to configure the middleware are the
-            following (these can be provided when initializing the middleware
-            object or when setting up the api-paste.ini file)::
-
-                hmac_keys = KEY1, KEY2 (can be a single key as well)
-
-        Actually the algorithm is a bit more complex. The Python client will
-        also sign the trace info with a `HMAC`_ key (lets call that key ``A``)
-        passed to profiler.init, and on reception the WSGI middleware will
-        check that it's signed with *one of* the HMAC keys (the wsgi
-        server should have key ``A`` as well, but may also have keys ``B``
-        and ``C``) that are specified in api-paste.ini. This ensures that only
-        the user that knows the HMAC key ``A`` in api-paste.ini can init a
-        profiler properly and send trace info that will be actually
-        processed. This ensures that trace info that is sent in that
-        does **not** pass the HMAC validation will be discarded. **NOTE:** The
-        application of many possible *validation* keys makes it possible to
-        roll out a key upgrade in a non-impactful manner (by adding a key into
-        the list and rolling out that change and then removing the older key at
-        some time in the future).
-
-    * RPC API
-
-        RPC calls are used for interaction between services of one project.
-        It's well known that projects are using `oslo.messaging`_ to deal with
-        RPC. It's very good, because projects deal with RPC in similar way.
-
-        So there are 2 required changes:
-
-        * On callee side put in request context trace info (if profiler was
-          initialized)
-
-        * On caller side initialize profiler, if there is trace info in request
-          context.
-
-        * Trace all methods of callee API (can be done via profiler.trace_cls).
-
-
-* **What points should be tracked by default?**
-
-   I think that for all projects we should include by default 5 kinds of points:
-
-   * All HTTP calls - helps to get information about: what HTTP requests were
-     done, duration of calls (latency of service), information about projects
-     involved in request.
-
-   * All RPC calls - helps to understand duration of parts of request related
-     to different services in one project. This information is essential to
-     understand which service produce the bottleneck.
-
-   * All DB API calls - in some cases slow DB query can produce bottleneck. So
-     it's quite useful to track how much time request spend in DB layer.
-
-   * All driver calls - in case of nova, cinder and others we have vendor
-     drivers. Duration
-
-   * ALL SQL requests (turned off by default, because it produce a lot of
-     traffic)
-
-.. _CONF: http://docs.openstack.org/developer/oslo.config/
-.. _HMAC: http://en.wikipedia.org/wiki/Hash-based_message_authentication_code
-.. _OpenStack: http://openstack.org/
-.. _Ceilometer: https://wiki.openstack.org/wiki/Ceilometer
-.. _oslo.messaging: https://pypi.python.org/pypi/oslo.messaging
-.. _OSprofiler WSGI middleware: https://github.com/openstack/osprofiler/blob/master/osprofiler/web.py
+===========================================================
+ OSProfiler -- Library for cross-project profiling library
+===========================================================
+
+.. image:: https://img.shields.io/pypi/v/osprofiler.svg
+    :target: https://pypi.python.org/pypi/osprofiler/
+    :alt: Latest Version
+
+.. image:: https://img.shields.io/pypi/dm/osprofiler.svg
+    :target: https://pypi.python.org/pypi/osprofiler/
+    :alt: Downloads
+
+OSProfiler provides a tiny but powerful library that is used by
+most (soon to be all) OpenStack projects and their python clients. It
+provides functionality to be able to generate 1 trace per request, that goes
+through all involved services. This trace can then be extracted and used
+to build a tree of calls which can be quite handy for a variety of
+reasons (for example in isolating cross-project performance issues).
+
+* Free software: Apache license
+* Documentation: http://docs.openstack.org/developer/osprofiler
+* Source: http://git.openstack.org/cgit/openstack/osprofiler
+* Bugs: http://bugs.launchpad.net/osprofiler
--- a/doc/source/api.rst
+++ b/doc/source/api.rst
@ -0,0 +1,181 @@
+======
+ API
+======
+
+There are a few things that you should know about API before using it.
+
+Four ways to add a new trace point.
+-----------------------------------
+
+.. code-block:: python
+
+    from osprofiler import profiler
+
+    def some_func():
+        profiler.start("point_name", {"any_key": "with_any_value"})
+        # your code
+        profiler.stop({"any_info_about_point": "in_this_dict"})
+
+
+    @profiler.trace("point_name",
+                    info={"any_info_about_point": "in_this_dict"},
+                    hide_args=False)
+    def some_func2(*args, **kwargs):
+        # If you need to hide args in profile info, put hide_args=True
+        pass
+
+    def some_func3():
+        with profiler.Trace("point_name",
+                            info={"any_key": "with_any_value"}):
+            # some code here
+
+    @profiler.trace_cls("point_name", info={}, hide_args=False,
+                        trace_private=False)
+    class TracedClass(object):
+
+        def traced_method(self):
+            pass
+
+        def _traced_only_if_trace_private_true(self):
+             pass
+
+How profiler works?
+-------------------
+
+* **@profiler.Trace()** and **profiler.trace()** are just syntax sugar,
+  that just calls **profiler.start()** & **profiler.stop()** methods.
+
+* Every call of **profiler.start()** & **profiler.stop()** sends to
+  **collector** 1 message. It means that every trace point creates 2 records
+  in the collector. *(more about collector & records later)*
+
+* Nested trace points are supported. The sample below produces 2 trace points:
+
+    .. code-block:: python
+
+        profiler.start("parent_point")
+        profiler.start("child_point")
+        profiler.stop()
+        profiler.stop()
+
+    The implementation is quite simple. Profiler has one stack that contains
+    ids of all trace points. E.g.:
+
+    .. code-block:: python
+
+        profiler.start("parent_point") # trace_stack.push(<new_uuid>)
+                                       # send to collector -> trace_stack[-2:]
+
+        profiler.start("parent_point") # trace_stack.push(<new_uuid>)
+                                       # send to collector -> trace_stack[-2:]
+        profiler.stop()                # send to collector -> trace_stack[-2:]
+                                       # trace_stack.pop()
+
+        profiler.stop()                # send to collector -> trace_stack[-2:]
+                                       # trace_stack.pop()
+
+    It's simple to build a tree of nested trace points, having
+    **(parent_id, point_id)** of all trace points.
+
+Process of sending to collector.
+--------------------------------
+
+Trace points contain 2 messages (start and stop). Messages like below are
+sent to a collector:
+
+.. parsed-literal::
+
+  {
+      "name": <point_name>-(start|stop)
+      "base_id": <uuid>,
+      "parent_id": <uuid>,
+      "trace_id": <uuid>,
+      "info": <dict>
+  }
+
+The fields are defined as the following:
+
+* base_id - ``<uuid>`` that is equal for all trace points that belong
+  to one trace, this is done to simplify the process of retrieving
+  all trace points related to one trace from collector
+* parent_id - ``<uuid>`` of parent trace point
+* trace_id - ``<uuid>`` of current trace point
+* info - the dictionary that contains user information passed when calling
+  profiler **start()** & **stop()** methods.
+
+Setting up the collector.
+-------------------------
+
+The profiler doesn't include a trace point collector. The user/developer
+should instead provide a method that sends messages to a collector. Let's
+take a look at a trivial sample, where the collector is just a file:
+
+.. code-block:: python
+
+    import json
+
+    from osprofiler import notifier
+
+    def send_info_to_file_collector(info, context=None):
+        with open("traces", "a") as f:
+            f.write(json.dumps(info))
+
+    notifier.set(send_info_to_file_collector)
+
+So now on every **profiler.start()** and **profiler.stop()** call we will
+write info about the trace point to the end of the **traces** file.
+
+Initialization of profiler.
+---------------------------
+
+If profiler is not initialized, all calls to **profiler.start()** and
+**profiler.stop()** will be ignored.
+
+Initialization is a quite simple procedure.
+
+.. code-block:: python
+
+    from osprofiler import profiler
+
+    profiler.init("SECRET_HMAC_KEY", base_id=<uuid>, parent_id=<uuid>)
+
+``SECRET_HMAC_KEY`` - will be discussed later, because it's related to the
+integration of OSprofiler & OpenStack.
+
+**base_id** and **trace_id** will be used to initialize stack_trace in
+profiler, e.g. ``stack_trace = [base_id, trace_id]``.
+
+OSProfiler CLI.
+---------------
+
+To make it easier for end users to work with profiler from CLI, osprofiler
+has entry point that allows them to retrieve information about traces and
+present it in human readable from.
+
+Available commands:
+
+* Help message with all available commands and their arguments:
+
+    .. parsed-literal::
+
+        $ osprofiler -h/--help
+
+* OSProfiler version:
+
+    .. parsed-literal::
+
+        $ osprofiler -v/--version
+
+* Results of profiling can be obtained in JSON (option: ``--json``) and HTML
+  (option: ``--html``) formats:
+
+    .. parsed-literal::
+
+        $ osprofiler trace show <trace_id> --json/--html
+
+    hint: option ``--out`` will redirect result of ``osprofiler trace show``
+    in specified file:
+
+    .. parsed-literal::
+
+        $ osprofiler trace show <trace_id> --json/--html --out /path/to/file
--- a/doc/source/background.rst
+++ b/doc/source/background.rst
@ -0,0 +1,32 @@
+============
+ Background
+============
+
+OpenStack consists of multiple projects. Each project, in turn, is composed of
+multiple services. To process some request, e.g. to boot a virtual machine,
+OpenStack uses multiple services from different projects. In the case something
+works too slowly, it's extremely complicated to understand what exactly goes
+wrong and to locate the bottleneck.
+
+To resolve this issue, we introduce a tiny but powerful library,
+**osprofiler**, that is going to be used by all OpenStack projects and their
+python clients. To be able to generate 1 trace per request, that goes through
+all involved services, and builds a tree of calls.
+
+Why not cProfile and etc?
+-------------------------
+
+**The scope of this library is quite different:**
+
+* We are interested in getting one trace of points from different service,
+  not tracing all python calls inside one process.
+
+* This library should be easy integratable in OpenStack. This means that:
+
+  * It shouldn't require too many changes in code bases of integrating
+    projects.
+
+  * We should be able to turn it off fully.
+
+  * We should be able to keep it turned on in lazy mode in production
+    (e.g. admin should be able to "trace" on request).
--- a/doc/source/history.rst
+++ b/doc/source/history.rst
@ -0,0 +1 @@
+.. include:: ../../ChangeLog
--- a/doc/source/index.rst
+++ b/doc/source/index.rst
@ -1 +0,0 @@
-../../README.rst
--- a/doc/source/index.rst
+++ b/doc/source/index.rst
@ -0,0 +1,33 @@
+===========================================================
+ OSProfiler -- Library for cross-project profiling library
+===========================================================
+
+OSProfiler provides a tiny but powerful library that is used by
+most (soon to be all) OpenStack projects and their python clients. It
+provides functionality to be able to generate 1 trace per request, that goes
+through all involved services. This trace can then be extracted and used
+to build a tree of calls which can be quite handy for a variety of
+reasons (for example in isolating cross-project performance issues).
+
+.. toctree::
+   :maxdepth: 2
+
+   background
+   api
+   integration
+
+Release Notes
+=============
+
+.. toctree::
+   :maxdepth: 1
+
+   history
+
+Indices and tables
+==================
+
+* :ref:`genindex`
+* :ref:`modindex`
+* :ref:`search`
+
--- a/doc/source/integration.rst
+++ b/doc/source/integration.rst
@ -0,0 +1,129 @@
+=============
+ Integration
+=============
+
+There are 4 topics related to integration OSprofiler & `OpenStack`_:
+
+What we should use as a centralized collector?
+----------------------------------------------
+
+  We decided to use `Ceilometer`_, because:
+
+  * It's already integrated in OpenStack, so it's quite simple to send
+    notifications to it from all projects.
+
+  * There is an OpenStack API in Ceilometer that allows us to retrieve all
+    messages related to one trace. Take a look at
+    *osprofiler.parsers.ceilometer:get_notifications*
+
+
+How to setup profiler notifier?
+-------------------------------
+
+  We decided to use olso.messaging Notifier API, because:
+
+  * `oslo.messaging`_ is integrated in all projects
+
+  * It's the simplest way to send notification to Ceilometer, take a
+    look at: *osprofiler.notifiers.messaging.Messaging:notify* method
+
+  * We don't need to add any new `CONF`_ options in projects
+
+
+How to initialize profiler, to get one trace across all services?
+-----------------------------------------------------------------
+
+    To enable cross service profiling we actually need to do send from caller
+    to callee (base_id & trace_id). So callee will be able to init its profiler
+    with these values.
+
+    In case of OpenStack there are 2 kinds of interaction between 2 services:
+
+    * REST API
+
+        It's well known that there are python clients for every project,
+        that generate proper HTTP requests, and parse responses to objects.
+
+        These python clients are used in 2 cases:
+
+        * User access -> OpenStack
+
+        * Service from Project 1 would like to access Service from Project 2
+
+
+        So what we need is to:
+
+        * Put in python clients headers with trace info (if profiler is inited)
+
+        * Add `OSprofiler WSGI middleware`_ to your service, this initializes
+          the profiler, if and only if there are special trace headers, that
+          are signed by one of the HMAC keys from api-paste.ini (if multiple
+          keys exist the signing process will continue to use the key that was
+          accepted during validation).
+
+          * The common items that are used to configure the middleware are the
+            following (these can be provided when initializing the middleware
+            object or when setting up the api-paste.ini file)::
+
+                hmac_keys = KEY1, KEY2 (can be a single key as well)
+
+        Actually the algorithm is a bit more complex. The Python client will
+        also sign the trace info with a `HMAC`_ key (lets call that key ``A``)
+        passed to profiler.init, and on reception the WSGI middleware will
+        check that it's signed with *one of* the HMAC keys (the wsgi
+        server should have key ``A`` as well, but may also have keys ``B``
+        and ``C``) that are specified in api-paste.ini. This ensures that only
+        the user that knows the HMAC key ``A`` in api-paste.ini can init a
+        profiler properly and send trace info that will be actually
+        processed. This ensures that trace info that is sent in that
+        does **not** pass the HMAC validation will be discarded. **NOTE:** The
+        application of many possible *validation* keys makes it possible to
+        roll out a key upgrade in a non-impactful manner (by adding a key into
+        the list and rolling out that change and then removing the older key at
+        some time in the future).
+
+    * RPC API
+
+        RPC calls are used for interaction between services of one project.
+        It's well known that projects are using `oslo.messaging`_ to deal with
+        RPC. It's very good, because projects deal with RPC in similar way.
+
+        So there are 2 required changes:
+
+        * On callee side put in request context trace info (if profiler was
+          initialized)
+
+        * On caller side initialize profiler, if there is trace info in request
+          context.
+
+        * Trace all methods of callee API (can be done via profiler.trace_cls).
+
+
+What points should be tracked by default?
+-----------------------------------------
+
+   I think that for all projects we should include by default 5 kinds of points:
+
+   * All HTTP calls - helps to get information about: what HTTP requests were
+     done, duration of calls (latency of service), information about projects
+     involved in request.
+
+   * All RPC calls - helps to understand duration of parts of request related
+     to different services in one project. This information is essential to
+     understand which service produce the bottleneck.
+
+   * All DB API calls - in some cases slow DB query can produce bottleneck. So
+     it's quite useful to track how much time request spend in DB layer.
+
+   * All driver calls - in case of nova, cinder and others we have vendor
+     drivers. Duration
+
+   * ALL SQL requests (turned off by default, because it produce a lot of
+     traffic)
+
+.. _CONF: http://docs.openstack.org/developer/oslo.config/
+.. _HMAC: http://en.wikipedia.org/wiki/Hash-based_message_authentication_code
+.. _OpenStack: http://openstack.org/
+.. _Ceilometer: https://wiki.openstack.org/wiki/Ceilometer
+.. _oslo.messaging: https://pypi.python.org/pypi/oslo.messaging
+.. _OSprofiler WSGI middleware: https://github.com/openstack/osprofiler/blob/master/osprofiler/web.py