1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
|
Glossary
========
**Brick**
: A Brick is the basic unit of storage in GlusterFS, represented by an
export directory on a server in the trusted storage pool. A Brick is
represented by combining a server name with an export directory in the
following format:
`SERVER:EXPORT`
For example:
`myhostname:/exports/myexportdir/`
**Client**
: Any machine that mounts a GlusterFS volume.
**Cluster**
: A cluster is a group of linked computers, working together closely
thus in many respects forming a single computer.
**Distributed File System**
: A file system that allows multiple clients to concurrently access
data over a computer network.
**Extended Attributes**
: Extended file attributes (abbreviated xattr) is a file system feature
that enables users/programs to associate files/dirs with metadata.
**Filesystem**
: A method of storing and organizing computer files and their data.
Essentially, it organizes these files into a database for the
storage, organization, manipulation, and retrieval by the computer's
operating system.
Source: [Wikipedia][]
**FUSE**
: Filesystem in Userspace (FUSE) is a loadable kernel module for
Unix-like computer operating systems that lets non-privileged users
create their own file systems without editing kernel code. This is
achieved by running file system code in user space while the FUSE
module provides only a "bridge" to the actual kernel interfaces.
Source: [Wikipedia][1]
**Geo-Replication**
: Geo-replication provides a continuous, asynchronous, and incremental
replication service from site to another over Local Area Networks
(LAN), Wide Area Network (WAN), and across the Internet.
**GFID**
: Each file/directory on a GlusterFS volume has a unique 128-bit number
associated with it called the GFID. This is analogous to inode in a
regular filesystem.
**glusterd**
: The Gluster management daemon that needs to run on all servers in
the trusted storage pool.
**Infiniband**
InfiniBand is a switched fabric computer network communications link
used in high-performance computing and enterprise data centers.
**Metadata**
: Metadata is data providing information about one or more other
pieces of data.
**Namespace**
: Namespace is an abstract container or environment created to hold a
logical grouping of unique identifiers or symbols. Each Gluster
volume exposes a single namespace as a POSIX mount point that
contains every file in the cluster.
**Node**
: A server or computer that hosts one or more bricks.
**Open Source**
: Open source describes practices in production and development that
promote access to the end product's source materials. Some consider
open source a philosophy, others consider it a pragmatic
methodology.
Before the term open source became widely adopted, developers and
producers used a variety of phrases to describe the concept; open
source gained hold with the rise of the Internet, and the attendant
need for massive retooling of the computing source code.
Opening the source code enabled a self-enhancing diversity of
production models, communication paths, and interactive communities.
Subsequently, a new, three-word phrase "open source software" was
born to describe the environment that the new copyright, licensing,
domain, and consumer issues created.
Source: [Wikipedia][2]
**Petabyte**
: A petabyte (derived from the SI prefix peta- ) is a unit of
information equal to one quadrillion (short scale) bytes, or 1000
terabytes. The unit symbol for the petabyte is PB. The prefix peta-
(P) indicates a power of 1000:
1 PB = 1,000,000,000,000,000 B = 10005 B = 1015 B.
The term "pebibyte" (PiB), using a binary prefix, is used for the
corresponding power of 1024.
Source: [Wikipedia][3]
**POSIX**
: Portable Operating System Interface (for Unix) is the name of a
family of related standards specified by the IEEE to define the
application programming interface (API), along with shell and
utilities interfaces for software compatible with variants of the
Unix operating system. Gluster exports a fully POSIX compliant file
system.
**Quorum**
: The configuration of quorum in a trusted storage pool determines the
number of server failures that the trusted storage pool can sustain.
If an additional failure occurs, the trusted storage pool becomes
unavailable.
**Quota**
: Quotas allow you to set limits on usage of disk space by directories or
by volumes.
**RAID**
: Redundant Array of Inexpensive Disks (RAID) is a technology that
provides increased storage reliability through redundancy, combining
multiple low-cost, less-reliable disk drives components into a
logical unit where all drives in the array are interdependent.
**RDMA**
: Remote direct memory access (RDMA) is a direct memory access from the
memory of one computer into that of another without involving either
one's operating system. This permits high-throughput, low-latency
networking, which is especially useful in massively parallel computer
clusters.
**Rebalance**
: A process of fixing layout and resdistributing data in a volume when a
brick is added or removed.
**RRDNS**
: Round Robin Domain Name Service (RRDNS) is a method to distribute
load across application servers. RRDNS is implemented by creating
multiple A records with the same name and different IP addresses in
the zone file of a DNS server.
**Samba**
: Samba allows file and print sharing between computers running Windows and
computers running Linux. It is an implementation of several services and
protocols including SMB and CIFS.
**Self-Heal**
: The self-heal daemon that runs in the background, identifies
inconsistencies in files/dirs in a replicated volume and then resolves
or heals them. This healing process is usually required when one or more
bricks of a volume goes down and then comes up later.
**Split-brain**
: This is a situation where data on two or more bricks in a replicated
volume start to diverge in terms of content or metadata. In this state,
one cannot determine programitically which set of data is "right" and
which is "wrong".
**Translator**
: Translators (also called xlators) are stackable modules where each
module has a very specific purpose. Translators are stacked in a
hierarchical structure called as graph. A translator receives data
from its parent translator, performs necessary operations and then
passes the data down to its child translator in hierarchy.
**Trusted Storage Pool**
: A storage pool is a trusted network of storage servers. When you
start the first server, the storage pool consists of that server
alone.
**Userspace**
: Applications running in user space don’t directly interact with
hardware, instead using the kernel to moderate access. Userspace
applications are generally more portable than applications in kernel
space. Gluster is a user space application.
**Volfile**
: Volfile is a configuration file used by glusterfs process. Volfile
will be usually located at `/var/lib/glusterd/vols/VOLNAME`.
**Volume**
: A volume is a logical collection of bricks. Most of the gluster
management operations happen on the volume.
[Wikipedia]: http://en.wikipedia.org/wiki/Filesystem
[1]: http://en.wikipedia.org/wiki/Filesystem_in_Userspace
[2]: http://en.wikipedia.org/wiki/Open_source
[3]: http://en.wikipedia.org/wiki/Petabyte
|