lfs_administration.md 9.71 KB
Newer Older
1
# GitLab Git LFS Administration
2 3 4 5 6 7

Documentation on how to use Git LFS are under [Managing large binary files with Git LFS doc](manage_large_binaries_with_git_lfs.md).

## Requirements

* Git LFS is supported in GitLab starting with version 8.2.
8
* Support for object storage, such as AWS S3, was introduced in 10.0.
9
* Users need to install [Git LFS client](https://git-lfs.github.com) version 1.0.1 and up.
10 11 12

## Configuration

13 14
Git LFS objects can be large in size. By default, they are stored on the server
GitLab is installed on.
15

16
There are various configuration options to help GitLab server administrators:
17 18 19

* Enabling/disabling Git LFS support
* Changing the location of LFS object storage
20
* Setting up object storage supported by [Fog](http://fog.io/about/provider_documentation.html)
21

22
### Configuration for Omnibus installations
23 24 25 26

In `/etc/gitlab/gitlab.rb`:

```ruby
27
# Change to true to enable lfs
28
gitlab_rails['lfs_enabled'] = false
Drew Blessing's avatar
Drew Blessing committed
29 30 31 32

# Optionally, change the storage path location. Defaults to
# `#{gitlab_rails['shared_path']}/lfs-objects`. Which evaluates to
# `/var/opt/gitlab/gitlab-rails/shared/lfs-objects` by default.
33 34 35
gitlab_rails['lfs_storage_path'] = "/mnt/storage/lfs-objects"
```

36
### Configuration for installations from source
37 38 39 40

In `config/gitlab.yml`:

```yaml
41
# Change to true to enable lfs
42 43 44 45 46
  lfs:
    enabled: false
    storage_path: /mnt/storage/lfs-objects
```

47
## Storing LFS objects in remote object storage
48

49 50
> [Introduced][ee-2760] in [GitLab Premium][eep] 10.0. Brought to GitLab Core
in 10.7.
51

52 53 54 55 56 57
It is possible to store LFS objects in remote object storage which allows you
to offload local hard disk R/W operations, and free up disk space significantly.
GitLab is tightly integrated with `Fog`, so you can refer to its [documentation](http://fog.io/about/provider_documentation.html)
to check which storage services can be integrated with GitLab.
You can also use external object storage in a private local network. For example,
[Minio](https://www.minio.io/) is a standalone object storage service, is easy to setup, and works well with GitLab instances.
58

59
GitLab provides two different options for the uploading mechanism: "Direct upload" and "Background upload".
60 61 62

**Option 1. Direct upload**

63 64 65
1. User pushes an lfs file to the GitLab instance
1. GitLab-workhorse uploads the file directly to the external object storage
1. GitLab-workhorse notifies GitLab-rails that the upload process is complete
66 67 68

**Option 2. Background upload**

69 70 71
1. User pushes an lfs file to the GitLab instance
1. GitLab-rails stores the file in the local file storage
1. GitLab-rails then uploads the file to the external object storage asynchronously
72

73
The following general settings are supported.
74 75 76 77 78

| Setting | Description | Default |
|---------|-------------|---------|
| `enabled` | Enable/disable object storage | `false` |
| `remote_directory` | The bucket name where LFS objects will be stored| |
79
| `direct_upload` | Set to true to enable direct upload of LFS without the need of local shared storage. Option may be removed once we decide to support only single storage for all files. | `false` |
80
| `background_upload` | Set to false to disable automatic upload. Option may be removed once upload is direct to S3 | `true` |
Micaël Bergeron's avatar
Micaël Bergeron committed
81
| `proxy_download` | Set to true to enable proxying all files served. Option allows to reduce egress traffic as this allows clients to download directly from remote storage instead of proxying all data | `false` |
82 83
| `connection` | Various connection options described below | |

84
The `connection` settings match those provided by [Fog](https://github.com/fog).
85

86
Here is a configuration example with S3.
87 88

| Setting | Description | example |
89
|---------|-------------|---------|
90 91 92
| `provider` | The provider name | AWS |
| `aws_access_key_id` | AWS credentials, or compatible | `ABC123DEF456` |
| `aws_secret_access_key` | AWS credentials, or compatible | `ABC123DEF456ABC123DEF456ABC123DEF456` |
93
| `aws_signature_version` | AWS signature version to use. 2 or 4 are valid options. Digital Ocean Spaces and other providers may need 2. | 4 |
94 95 96 97 98
| `region` | AWS region | us-east-1 |
| `host` | S3 compatible host for when not using AWS, e.g. `localhost` or `storage.example.com` | s3.amazonaws.com |
| `endpoint` | Can be used when configuring an S3 compatible service such as [Minio](https://www.minio.io), by entering a URL such as `http://127.0.0.1:9000` | (optional) |
| `path_style` | Set to true to use `host/bucket_name/object` style paths instead of `bucket_name.host/object`. Leave as false for AWS S3 | false |

99 100 101 102 103 104
Here is a configuration example with GCS.

| Setting | Description | example |
|---------|-------------|---------|
| `provider` | The provider name | `Google` |
| `google_project` | GCP project name | `gcp-project-12345` |
105 106
| `google_client_email` | The email address of the service account | `foo@gcp-project-12345.iam.gserviceaccount.com` |
| `google_json_key_location` | The json key path | `/path/to/gcp-project-12345-abcde.json` |
107

108
_NOTE: The service account must have permission to access the bucket. [See more](https://cloud.google.com/storage/docs/authentication)_
109 110 111

### Manual uploading to an object storage

112
There are two ways to manually do the same thing as automatic uploading (described above).
113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130

**Option 1: rake task**

```
$ rake gitlab:lfs:migrate
```

**Option 2: rails console**

```
$ sudo gitlab-rails console            # Login to rails console

> # Upload LFS files manually
> LfsObject.where(file_store: [nil, 1]).find_each do |lfs_object|
>   lfs_object.file.migrate!(ObjectStorage::Store::REMOTE) if lfs_object.file.file.exists?
> end
```

131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162
### S3 for Omnibus installations

On Omnibus installations, the settings are prefixed by `lfs_object_store_`:

1. Edit `/etc/gitlab/gitlab.rb` and add the following lines by replacing with
   the values you want:

	```ruby
	gitlab_rails['lfs_object_store_enabled'] = true
	gitlab_rails['lfs_object_store_remote_directory'] = "lfs-objects"
	gitlab_rails['lfs_object_store_connection'] = {
	  'provider' => 'AWS',
	  'region' => 'eu-central-1',
	  'aws_access_key_id' => '1ABCD2EFGHI34JKLM567N',
	  'aws_secret_access_key' => 'abcdefhijklmnopQRSTUVwxyz0123456789ABCDE',
	  # The below options configure an S3 compatible host instead of AWS
	  'host' => 'localhost',
	  'endpoint' => 'http://127.0.0.1:9000',
	  'path_style' => true
	}
	```

1. Save the file and [reconfigure GitLab]s for the changes to take effect.
1. Migrate any existing local LFS objects to the object storage:

    ```bash
    gitlab-rake gitlab:lfs:migrate
    ```

    This will migrate existing LFS objects to object storage. New LFS objects
    will be forwarded to object storage unless
    `gitlab_rails['lfs_object_store_background_upload']` is set to false.
163

164 165 166 167
### S3 for installations from source

For source installations the settings are nested under `lfs:` and then
`object_store:`:
168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191

1. Edit `/home/git/gitlab/config/gitlab.yml` and add or amend the following
   lines:

	```yaml
	lfs:
	enabled: true
	object_store:
	  enabled: false
	  remote_directory: lfs-objects # Bucket name
	  connection:
	    provider: AWS
	    aws_access_key_id: 1ABCD2EFGHI34JKLM567N
	    aws_secret_access_key: abcdefhijklmnopQRSTUVwxyz0123456789ABCDE
	    region: eu-central-1
	    # Use the following options to configure an AWS compatible host such as Minio
	    host: 'localhost'
	    endpoint: 'http://127.0.0.1:9000'
	    path_style: true
	```

1. Save the file and [restart GitLab][] for the changes to take effect.
1. Migrate any existing local LFS objects to the object storage:

192 193 194
    ```bash
    sudo -u git -H bundle exec rake gitlab:lfs:migrate RAILS_ENV=production
    ```
195

196 197 198
    This will migrate existing LFS objects to object storage. New LFS objects
    will be forwarded to object storage unless `background_upload` is set to
    false.
199

200 201 202
## Storage statistics

You can see the total storage used for LFS objects on groups and projects
203 204
in the administration area, as well as through the [groups](../../api/groups.md)
and [projects APIs](../../api/projects.md).
205

206 207
## Troubleshooting: `Google::Apis::TransmissionError: execution expired`

208 209 210
If LFS integration is configred with Google Cloud Storage and background uploads (`background_upload: true` and `direct_upload: false`),
sidekiq workers may encouter this error. This is because the uploading timed out with very large files.
LFS files up to 6Gb can be uploaded without any extra steps, otherwise you need to use the following workaround.
211 212 213 214

```shell
$ sudo gitlab-rails console            # Login to rails console

215 216
> # Set up timeouts. 20 minutes is enough to upload 30GB LFS files.
> # These settings are only in effect for the same session, i.e. they are not effective for sidekiq workers.
217 218 219 220 221 222 223 224 225 226
> ::Google::Apis::ClientOptions.default.open_timeout_sec = 1200
> ::Google::Apis::ClientOptions.default.read_timeout_sec = 1200
> ::Google::Apis::ClientOptions.default.send_timeout_sec = 1200

> # Upload LFS files manually. This process does not use sidekiq at all.
> LfsObject.where(file_store: [nil, 1]).find_each do |lfs_object|
>   lfs_object.file.migrate!(ObjectStorage::Store::REMOTE) if lfs_object.file.file.exists?
> end
```

227
See more information in [!19581](https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/19581)
228

229 230
## Known limitations

231
* Support for removing unreferenced LFS objects was added in 8.14 onwards.
232 233
* LFS authentications via SSH was added with GitLab 8.12
* Only compatible with the GitLFS client versions 1.1.0 and up, or 1.0.2.
234 235
* The storage statistics currently count each LFS object multiple times for
  every project linking to it
236 237 238

[reconfigure gitlab]: ../../administration/restart_gitlab.md#omnibus-gitlab-reconfigure "How to reconfigure Omnibus GitLab"
[restart gitlab]: ../../administration/restart_gitlab.md#installations-from-source "How to restart GitLab"
239
[eep]: https://about.gitlab.com/pricing/ "GitLab Premium"
240
[ee-2760]: https://gitlab.com/gitlab-org/gitlab-ee/merge_requests/2760