Added architecture information to README, updated FAQ

The README.rst now contains some more details regarding
the architecture, the interactions with other components
and how basically the freezer agent interact with the
scheduler and the API

Change-Id: I4be42cc4e229962a0c2811789c87a7ba8b32e2e5
This commit is contained in:
Fausto Marzi 2015-07-25 23:37:44 +01:00
parent 08d7066d5c
commit bc399eb0d1
9 changed files with 177 additions and 53 deletions

90
FAQ.rst
View File

@ -1,66 +1,78 @@
===
FAQ
===
1) What is freezer? Is a tool to automate data backup and restore
process using OpenStack Swift.
1) **What is freezer**?
Is a tool to automate data backup and restore
process using OpenStack Swift and/or other media storage
2) Does freezer support incremental backup? Yes. Incremental backup are
done using GNU tar incremental features
2) **Does freezer support incremental backup?**
Yes. Incremental backup are done using GNU tar incremental features
3) Does freezer check the file contents to establish if a file was
modified or not? No. Freezer check for changes at mtime and ctime in
3) **Does freezer check the file contents to establish if a file was modified?**
Freezer check for changes at mtime and ctime in
every file inode to evaluate if a file changed or not.
4) Why GNU tar rather then rsync? Both approaches are good. Rsync check
4) **Why GNU tar rather then rsync?**
Both approaches are good. Rsync check
the file content, while tar check the file inode. In our
environment, we need to backup directories with size > 300GB and
tens of thousands of files. Rsync approach is effective but slow.
environment, we need to backup directories with size > 400GB and
hundreds of thousands of files. Rsync approach is effective but slow.
Tar is fast as it needs to check only the file inodes, rather then
the full file content.
the full file content. Rsync backup type will be added pretty soon
5) Does feezer support encrypted backup? Yes. Freezer encrypt data
using OpenSSL (AES-256-CFB).
5) *Does freezer support encrypted backup?*
Yes. Freezer encrypt data using OpenSSL (AES-256-CFB).
6) Does freezer execute point-in-time backup? Yes. For point in time
backup LVM snapshot feature used.
6) **Does freezer execute point-in-time backup?**
Yes. For point in time backup LVM snapshot feature used.
7) Can I use freezer on OSX or other OS where GNU Tar is not installed
by default? Yes. For OSX and \*BSD, just install gtar and freezer
7) **Can I use freezer on OSX or other OS where GNU Tar is not installed
by default?**
Yes. For OSX and \*BSD, just install gtar and freezer
automatically will use gtar to execute backup. OS other then Linux,
OSX and \*BSD are currently not supported.
8) What Application backup does freezer support currently? MongoDB and
File system.
8) **What Application backup does freezer support currently?**
MongoDB, MySQL to have a higher level of data consistency, while
any appplication is supported for crash consistent data integrity.
9) How does the MongoDB backup happens? Freezer required journal
enabled in mongo and lvm volume to execute backup. It checks if the
mongo instance is the master and create lvm snapshot to have
9) **How does the MongoDB backup happens?**
Freezer required journal enabled in Mongo and lvm volume to execute backup.
It checks if the Mongo instance is the master and create lvm snapshot to have
consistent data.
10) Does freezer manage sparse files efficiently? Yes. Zeroed/null data
is not backed up. So less space and bandwidth will be used.
10) **Does freezer manage sparse files efficiently?**
Yes. Zeroed/null data is not backed up. So less space and bandwidth will be used.
11) Does freezer remove automatically backup after some time? Yes. From
command line the --remove-older-then option (days) can be used to
11) **Does freezer remove automatically backup after some time?**
Yes. From command line the option --remove-older-then (days) can be used to
remove objects older then (days).
12) Does freezer support MySQL Backup? MySQL and MariaDB support soon
will be included.
12) **Does freezer support MySQL Backup?**
Yes.
13) Is there any other storage media supported? Not currently. There's a
plan to add:
13) **What storage media are currently supported?**
Current support media storage are:
a. Swift
b. Store files on a remote host file system using ssh
c. Directory in the local host (i.e. NFS/CIFS mounted volumes)
- Amazon S3
- Store files on a remote host file system
- MongoDB object storage.
- Other directory in the local host (NFS mounted volumes)
14) **Does freezer has any Web UI or API?**
Yes. Freezer has REST API and a Web UI integrated with Horizon
14) Does freezer has any UI or API? Not currently. The UI in OpenStack
Horizon is being currently developed as REST API too.
15) **Does Freezer detect removed files between incremental executions?**
Yes.
15) Tar is not capable to detect deleted files between different backup
levels. Is freezer capable of doing that? Not currently. We are
writing a set of tar extensions in python to overcome tar
limitations like this and others.
16) **Will Freezer be included as official project in OpenStack?**
We hope so, as soon as we can.
17) **Does freezer support Windows?**
Yes. The freezer agent and scheduler can be executed on Windows
18) **What is being used on Windows to execute file system snapshots?**
Curretnly VSS are used to support point in time snapshots
19) **What applications are supported in Windows for consisten backups?**
SQL Server (--mode sqlserver)

View File

@ -1,31 +1,39 @@
=======
Freezer
=======
Freezer is a Python tool that helps you to automate the data backup and
Freezer is a Backup Restore DR as a Service platform that helps you to automate the data backup and
restore process.
The following features are available:
- Backup your filesystem using snapshot to swift
- Backup your filesystem using point in time snapshot
- Strong encryption supported: AES-256-CFB
- Backup your file system tree directly (without volume snapshot)
- Backup your journaled MongoDB directory tree using lvm snap to swift
- Backup MySQL DB with lvm snapshot
- Restore your data automatically from Swift to your file system
- Restore your data from a specific date automatically to your file system
- Low storage consumption as the backup are uploaded as a stream
- Flexible Incremental backup policy
- Data is archived in GNU Tar format
- Data compression with gzip
- Flexible backup policy (incremental and differential)
- Data is archived in GNU Tar format for file based incremental
- Multiple compression algorithm support (zlib, bzip2, xz)
- Remove old backup automatically according the provided parameters
- Multiple storage media support (Swift, local file system, ssh)
- Flush kernel buffered memory to disk
- Multi platform (Linux, Windows, \*BSD, OSX)
- Manage multiple jobs (i.e. multiple backups on the same node)
- Synchronize backups and restore on multiple nodes
- Web user interface integrated with OpenStack Horizon
- Can execute scripts/commands before or after a job execution
Requirements
============
- OpenStack Swift Account (Auth V2 used)
- OpenStack Swift Account (optional)
- python
- GNU Tar >= 1.26
- gzip
- gzip, bzip2, xz
- OpenSSL
- python-swiftclient
- python-keystoneclient
@ -46,7 +54,8 @@ Ubuntu / Debian
Swift client and Keystone client::
$ sudo apt-get install -y python-swiftclient python-keystoneclient
$ sudo apt-get install -y python-dev
$ sudo easy_install -U pip
MongoDB backup::
@ -101,10 +110,10 @@ it to Path:
Install sync binaries from https://technet.microsoft.com/en-us/sysinternals/bb897438 and add
it to Path
Swift client and Keystone client::
> pip install python-swiftclient
> pip install python-keystoneclient
General packages::
> easy_install -U pip
> pip install freezer
The basic Swift account configuration is needed to use freezer. Make sure python-swiftclient is installed::
@ -332,7 +341,8 @@ Local storage restore execution:
Architecture
============
Freezer architecture is simple. The components are:
Freezer architectural components are the following:
- OpenStack Swift (the storage)
- freezer client running on the node you want to execute the backups or
@ -341,6 +351,86 @@ Freezer architecture is simple. The components are:
Freezer use GNU Tar under the hood to execute incremental backup and
restore. When a key is provided, it uses OpenSSL to encrypt data
(AES-256-CFB)
=======
Freezer architecture is composed by the following components:
+-------------------+------------------------------------------------------------------------------------------------------------------------------------------------+
| Component | Description |
+===================+================================================================================================================================================+
| Freezer Web UI | Web interface that interact with the Freezer API to configure, change settings. |
| | It provides most of the feature from the freezerc CLI, advanced scheduler settings as multi-node backup synchronization, metrics and reporting.|
+-------------------+------------------------------------------------------------------------------------------------------------------------------------------------+
| Freezer Scheduler | A client side component, running on the node where the data backup has to be executed. |
| | It consist of a daemon that retrieve the data from the freezer API and execute jobs (i.e. backups, restore, admin actions, info actions, |
| | pre and/or post job scripts) by running the Freezer Agent. |
| | The metrics and exit codes returned by the freezer agent are captured and sent to the Freezer API. |
| | The scheduler manage the execution and synchronization of multiple jobs executed on a single or multiple nodes. |
| | The status of the execution of all the nodes is saved through the API. |
| | The Freezer scheduler take care of uploading jobs to the API by reading job files on the file system. |
| | It also have its own configuration file where job session or other settings like the freezer API polling interval can be configured. |
| | The Freezer scheduler manage jobs, for more information about jobs please refer to: freezer_api/README.rst under JOB the sections |
+-------------------+------------------------------------------------------------------------------------------------------------------------------------------------+
| Freezer Agent | Multiprocessing Python software that runs at client side, where the data backup has to be executed. |
| | It can be executed standalone or by the Freezer Scheduler. |
| | The freezerc provide flexible way to execute backup, restore and other actions on a running system. |
| | In order to provide flexibility in terms of data integrity, speed, performance, resources usage etc the freezer agent offer a |
| | wide range of options to execute optimized backup according the available resources as: |
| | |
| | - Segments size (the amount of memory used) |
| | - Queues size (optimize backups where I/O, bandwidth, memory or CPU is a constraints) |
| | - I/O Affinity and process priority (it can be used with real time I/O and maximum user level process priority) |
| | - Bandwidth limitation |
| | - Client side Encryption (AES-256-CFB) |
| | - Compression (multiple algorithms supported as zlib, bzip2, xz/lzma) |
| | - Parallel upload to pluggable storage media (i.e. upload backup to swift and to a remote node by ssh, |
| | or upload to two or more independent swift instances with different credentials, etc) |
| | - Execute file based incremental (like tar), block based incremental (like rsync algorithm) and differential based backup and restore |
| | - Multiplatform as it can be run on Linux, Windows, \*BSD and OSX |
| | - Automatic removal of old backups |
+-------------------+------------------------------------------------------------------------------------------------------------------------------------------------+
| Freezer API | API are used to store and provide metadata to the Freezer Web UI and to the Freezer Scheduler. |
| | Also the API are used to store session information for multi node backup synchronization. No workload data is stored in the API. |
| | For more information to the API please refer to: freezer_api/README.rst |
+-------------------+------------------------------------------------------------------------------------------------------------------------------------------------+
| DB Elasticsearch | Backend used by the API to store and retrieve metrics, metadata sessions information job status, etc. |
+-------------------+------------------------------------------------------------------------------------------------------------------------------------------------+
Freezer currently use GNU Tar under the hood to execute incremental backup and
restore. When a key is provided, it uses OpenSSL to encrypt data (AES-256-CFB)
The following diagrams can help to better understand the solution:
**Service Architecture**
.. image::
specs/Service_Architecture_02.png
**Freezer Agent backup work flow with API**
.. image::
specs/freezer_agent_backup_api.png
**Freezer Agent backup without API**
.. image::
specs/freezer_agent_backup.png
**Freezer Scheduler with API**
.. image:: specs/freezer_scheduler_api.png
**Freezer Job Session**
.. image:: specs/job_session.png
**Freezer Dashboard**
.. image:: specs/freezer_dashboard.png
**How to scale**
.. image:: specs/freezer_scheduler_api.png
>>>>>>> Added architecture information to README, updated FAQ
Low resources requirement
-------------------------
@ -389,6 +479,26 @@ following basic logic happens when Freezer execute:
important as the Manifest file contains the information of the
previous Freezer execution.
The following is what the Swift Manifest looks like::
{
'X-Object-Meta-Encrypt-Data': 'Yes',
'X-Object-Meta-Segments-Size-Bytes': '134217728',
'X-Object-Meta-Backup-Created-Timestamp': '1395734461',
'X-Object-Meta-Remove-Backup-Older-Than-Days': '',
'X-Object-Meta-Src-File-To-Backup': '/var/lib/snapshot-backup/mongod_dev-mongo-s1',
'X-Object-Meta-Maximum-Backup-level': '0',
'X-Object-Meta-Always-Backup-Level': '',
'X-Object-Manifest': u'socorro-backup-dev_segments/dev-mongo-s1-r1_mongod_dev-mongo-s1_1395734461_0',
'X-Object-Meta-Backup-Current-Level': '0',
'X-Object-Meta-Abs-File-Path': '',
'X-Object-Meta-Backup-Name': 'mongod_dev-mongo-s1',
'X-Object-Meta-Tar-Meta-Obj-Name': 'tar_metadata_dev-mongo-s1-r1_mongod_dev-mongo-s1_1395734461_0',
'X-Object-Meta-Hostname': 'dev-mongo-s1-r1',
'X-Object-Meta-Container-Segments': 'socorro-backup-dev_segments'
}
>>>>>>> Added architecture information to README, updated FAQ
3) The most relevant data taken in consideration for incremental are:
- 'X-Object-Meta-Maximum-Backup-level': '7'
@ -421,7 +531,6 @@ Through this meta data, we can identify the exact Manifest name of the
provided backup name. The syntax is:
container\_name/hostname\_backup\_name\_timestamp\_initiallevel
- 'X-Object-Meta-Backup-Current-Level': '0'
Record the current backup level. This is important as the value is
@ -488,6 +597,8 @@ To use standard cinder backups please provide --cindernative-vol-id argument.
Miscellanea
-----------
Please check the FAQ to: FAQ.rst
Available options::
$ freezerc
@ -679,3 +790,4 @@ Available options::
example of config file: instance = <db-instance>
--vssadmin VSSADMIN Create a backup using a snapshot on windows using
vssadmin. Options are: True and False, default is True

Binary file not shown.

After

Width:  |  Height:  |  Size: 49 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 22 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 33 KiB

BIN
specs/freezer_dashboard.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 35 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 70 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 49 KiB

BIN
specs/job_session.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 85 KiB