introduction to docker (and a bit more) at lspe meetup sunnyvale

69
#lspe Introduction to Docker (and a bit more...) February 2014—updated for Docker 0.8.1

Upload: jerome-petazzoni

Post on 28-Jan-2015

115 views

Category:

Technology


3 download

DESCRIPTION

What's Docker, why does it matter, how does it use Linux Containers, why should you use it, and how? You'll find answers to those questions (and a bit more) in this presentation, given February 20th 2014 at the Large Scale Production Engineering Meet-Up at Yahoo, in Sunnyvale.

TRANSCRIPT

Page 1: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Introductionto

Docker

(and a bit more...)

February 2014—updated for Docker 0.8.1

Page 2: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Who?

●Jérôme Petazzoni — @jpetazzo–Not Maxime Petazzoni (author of Maestro NG)

–Not Thomas Petazzoni (embedded systems & kernel guru)

●Wrote dotCloud PAAS deployment tools–EC2, LXC, Puppet, Python, Shell, ØMQ...

●Docker contributor (CONTAINERIZE ALL THE THINGS!)

–Docker-in-Docker, VPN-in-Docker, router-in-Docker...●Runs Docker in production

–Psssst... You shouldn't do it, but here's how anyway!

Page 3: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Outline

● LXC and the container metaphor● Docker basics● Docker images● Docker deployment

Page 4: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Outline

● LXC and the container metaphor● Docker basics● Docker images● Docker deployment

Page 5: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

… Container ?

Page 6: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

High level approach:it's a lightweight VM

● own process space● own network interface● can run stuff as root● can have its own /sbin/init

(different from the host)

« Machine Container »

Page 7: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Low level approach:it's chroot on steroids

● can also not have its own /sbin/init● container = isolated process(es)● share kernel with host● no device emulation (neither HVM nor PV)

« Application Container »

Page 8: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

How does it work?Isolation with namespaces

● pid● mnt● net● uts● ipc● user

Page 9: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

pid namespace

jpetazzo@tarrasque:~$ ps aux | wc -l212

jpetazzo@tarrasque:~$ sudo docker run -t -i ubuntu bashroot@ea319b8ac416:/# ps auxUSER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMANDroot 1 0.0 0.0 18044 1956 ? S 02:54 0:00 bashroot 16 0.0 0.0 15276 1136 ? R+ 02:55 0:00 ps aux

(That's 2 processes)

Page 10: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

mnt namespace

jpetazzo@tarrasque:~$ wc -l /proc/mounts

32 /proc/mounts

root@ea319b8ac416:/# wc -l /proc/mounts

10 /proc/mounts

Page 11: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

net namespace

root@ea319b8ac416:/# ip addr

1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever

22: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000 link/ether 2a:d1:4b:7e:bf:b5 brd ff:ff:ff:ff:ff:ff inet 10.1.1.3/24 brd 10.1.1.255 scope global eth0 valid_lft forever preferred_lft forever inet6 fe80::28d1:4bff:fe7e:bfb5/64 scope link valid_lft forever preferred_lft forever

Page 12: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

uts namespace

jpetazzo@tarrasque:~$ hostnametarrasque

root@ea319b8ac416:/# hostnameea319b8ac416

Page 13: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

ipc namespace

jpetazzo@tarrasque:~$ ipcs------ Shared Memory Segments --------key shmid owner perms bytes nattch status0x00000000 3178496 jpetazzo 600 393216 2 dest 0x00000000 557057 jpetazzo 777 2778672 0 0x00000000 3211266 jpetazzo 600 393216 2 dest

root@ea319b8ac416:/# ipcs------ Shared Memory Segments --------key shmid owner perms bytes nattch status------ Semaphore Arrays --------key semid owner perms nsems ------ Message Queues --------key msqid owner perms used-bytes messages

Page 14: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

user namespace

● no « demo » for this one... Yet!● UID 0→1999 in container C1 is mapped to

UID 10000→11999 in host;UID 0→1999 in container C2 is mapped toUID 12000→13999 in host; etc.

● required lots of VFS and FS patches (esp. XFS)

● what will happen with copy-on-write?– double translation at VFS?

– single root UID on read-only FS?

Page 15: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

How does it work?Isolation with cgroups

● memory● cpu● blkio● devices

Page 16: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

memory cgroup

● keeps track pages used by each group:– file (read/write/mmap from block devices; swap)

– anonymous (stack, heap, anonymous mmap)

– active (recently accessed)

– inactive (candidate for eviction)

● each page is « charged » to a group● pages can be shared (e.g. if you use any COW FS)

● Individual (per-cgroup) limits and out-of-memory killer

Page 17: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

cpu and cpuset cgroups

● keep track of user/system CPU time● set relative weight per group● pin groups to specific CPU(s)

– Can be used to « reserve » CPUs for some apps

– This is also relevant for big NUMA systems

Page 18: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

blkio cgroups

● keep track IOs for each block device– read vs write; sync vs async

● set relative weights● set throttle (limits) for each block device

– read vs write; bytes/sec vs operations/sec

Note: earlier versions (<3.8) didn't account async correctly. 3.8 is better, but use 3.10 for best results.

Page 19: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

devices cgroups

● controls read/write/mknod permissions● typically:

– allow: /dev/{tty,zero,random,null}...

– deny: everything else

– maybe: /dev/net/tun, /dev/fuse, /dev/kvm, /dev/dri...

● fine-grained control for GPU, virtualization, etc.

Page 20: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

How does it work?Copy-on-write storage

● Create a new machine instantly(Instead of copying its whole filesystem)

● Storage keeps track of what has changed● Since 0.7, Docker has a storage plugin system

Page 21: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Storage:many options!

Union Filesystems

Snapshotting Filesystems

Copy-on-writeblock devices

Provisioning SuperfastSupercheap

FastCheap

FastCheap

Changingsmall files

SuperfastSupercheap

FastCheap

FastCostly

Changinglarge files

Slow (first time)Inefficient (copy-up!)

FastCheap

FastCheap

Diffing Superfast Superfast Slow

Memory usage Efficient Efficient Inefficient(at high densities)

Drawbacks Random quirksAUFS not mainline!AUFS more quirks

ZFS not mainlineBTRFS not as nice

Higher disk usageGreat performance (except diffing)

Bottom line Ideal for PAAS and high density things

This is the Future (probably)

Dodge Ram 3500

Page 22: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Compute efficiency:almost no overhead

● processes are isolated,but run straight on the host

● CPU performance = native performance

● memory performance = a few % shaved off for (optional) accounting

● network performance = small overhead; can be reduced to zero

Page 23: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Alright, I get this.Containers = nimble VMs.

Page 24: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Page 25: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

The container metaphor

Page 26: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Problem: shipping goods

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

Page 27: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Solution:the intermodal shipping container

Page 28: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Solved!

Page 29: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Problem: shipping code

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

? ? ? ? ? ?

Page 30: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Solution:the Linux container

Page 31: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Solved!

Page 32: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Separation of concerns:Dave the Developer

● inside my container:– my code

– my libraries

– my package manager

– my app

– my data

Page 33: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Separation of concerns:Oscar the Ops guy

● outside the container:– logging

– remote access

– network configuration

– monitoring

Page 34: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Outline

● LXC and the container metaphor● Docker basics● Docker images● Docker deployment

Page 35: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Page 36: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Docker-what?

● Open Source engine to commoditize LXC

STOP!

HAMMER DEMO TIME.

Page 37: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Page 38: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Yes, but...

● « I don't need Docker; I can do all that stuff with LXC tools, rsync,and some scripts! »

● correct on all accounts;but it's also true for apt, dpkg, rpm, yum, etc.

● the whole point is to commoditize,i.e. make it ridiculously easy to use

Page 39: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

What this really means…

● instead of writing « very small shell scripts » tomanage containers, write them to do the rest:– continuous deployment/integration/testing

– orchestration

● = use Docker as a building block● re-use other people images (yay ecosystem!)

Page 40: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Docker-what?The big picture

● Open Source engine to run containers● using copy-on-write for quick provisioning● allowing to create and share images● standard format for containers

(stack of layers; 1 layer = tarball+metadata)● standard, reproducible way to easily build

trusted images (Dockerfile, Stackbrew...)

Page 41: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Docker-what?Under the hood

● rewrite of dotCloud internal container engine– original version: Python, tied to dotCloud's internal stuff

– released version: Go, legacy-free

● the Docker daemon runs in the background– manages containers, images, and builds

– HTTP API (over UNIX or TCP socket)

– embedded CLI talking to the API

● Open Source (GitHub public repository + issue tracking)

Page 42: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Docker-what?The ecosystem

● Docker Inc. (formerly dotCloud Inc.)– ~30 employees, VC-backed

– SAAS and support offering around Docker

● Docker, the community– more than 300 contributors, 1500 forks on GitHub

– dozens of projects around/on top of Docker

– x100k trained developers

Page 43: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Outline

● LXC and the container metaphor● Docker basics● Docker images● Docker deployment

Page 44: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Authoring imageswith run/commit

Page 45: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

1) docker run ubuntu bash

2) apt-get install this and that

3) docker commit <containerid> <imagename>

4) docker run <imagename> bash

5) git clone git://.../mycode

6) pip install -r requirements.txt

7) docker commit <containerid> <imagename>

8) repeat steps 4-7 as necessary

9) docker tag <imagename> <user/image>

10) docker push <user/image>

Page 46: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Authoring imageswith run/commit

● Pros– Convenient, nothing to learn

– Can roll back/forward if needed

● Cons– Manual process

– Iterative changes stack up

– Full rebuilds are boring, error-prone

Page 47: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Authoring imageswith a Dockerfile

Page 48: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

FROM ubuntu

RUN apt-get -y updateRUN apt-get install -y g++RUN apt-get install -y erlang-dev erlang-manpages erlang-base-hipe ...RUN apt-get install -y libmozjs185-dev libicu-dev libtool ...RUN apt-get install -y make wget

RUN wget http://.../apache-couchdb-1.3.1.tar.gz | tar -C /tmp -zxf-RUN cd /tmp/apache-couchdb-* && ./configure && make install

RUN printf "[httpd]\nport = 8101\nbind_address = 0.0.0.0" > /usr/local/etc/couchdb/local.d/docker.ini

EXPOSE 8101CMD ["/usr/local/bin/couchdb"]

docker build -t jpetazzo/couchdb .

Page 49: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Authoring imageswith a Dockerfile

● Minimal learning curve● Rebuilds are easy● Caching system makes rebuilds faster● Single file to define the whole environment!

Page 50: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Do you evenChef?

Puppet?Ansible?

Salt?

Page 51: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Authoring Imageswith Chef/Puppet/Ansible/Salt/...

Plan A: « my other VM is a container »● write a Dockerfile to install $YOUR_CM● start tons of containers● run $YOUR_CM in them

Good if you want a mix of containers/VM/metal

But slower to deploy, and uses more resources

Page 52: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Authoring Imageswith Chef/Puppet/Ansible/Salt/...

Plan B: « the revolution will be containerized »● write a Dockerfile to install $YOUR_CM● … and run $YOUR_CM as part of build process● deploy fully baked images

Faster to deploy

Easier to rollback

Page 53: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Outline

● LXC and the container metaphor● Docker basics● Docker images● Docker deployment

Page 54: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Install Docker

● On your servers (Linux)– Packages (Ubuntu, Debian, Fedora, Gentoo, Arch...)

– Single binary install (Golang FTW!)

– Easy provisioning on Rackspace, Digital Ocean, EC2, GCE...

● On your dev env (Linux, OS X, Windows)– Vagrantfile

– boot2docker (25 MB VM image)

– Natively (if you run Linux)

Page 55: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Run containers

● SSH+CLI● REST API● Maestro NG● Mesos● OpenStack● … and more

Page 56: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Service discovery

● you can name your containers● you can link your containers

docker run -d -name frontdb mysqlimagedocker run -d -link frontdb:sql nginximage

→ environment vars are injected in web container

→ twelve-factors FTW!

Page 57: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Resource allocation

Page 58: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Allocating CPU and RAM

● docker run -c $CPU_SHARES -m $RAM_MB

Docker API will soon expose:● total CPU/RAM● allocated CPU/RAM● used CPU/RAM

WARNING: memory metrics are tricky!

Page 59: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Memory metrics

● freejpetazzo@tarrasque:~$ free total used free shared buffers cachedMem: 8076564 7019656 1056908 0 8 2634528-/+ buffers/cache: 4385120 3691444Swap: 0 0 0

● this is useless

Page 60: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Memory metrics

● cat /proc/meminfo

● GRRRREAT

MemTotal: 8076564 kBMemFree: 1055260 kBBuffers: 8 kBCached: 2634528 kBSwapCached: 0 kBActive: 3198088 kBInactive: 1407516 kBActive(anon): 1986692 kBInactive(anon): 185252 kBActive(file): 1211396 kBInactive(file): 1222264 kBUnevictable: 0 kBMlocked: 0 kBSwapTotal: 0 kBSwapFree: 0 kBDirty: 144 kBWriteback: 0 kBAnonPages: 1971208 kBMapped: 2402276 kBShmem: 200876 kBSlab: 148020 kBSReclaimable: 90680 kBSUnreclaim: 57340 kBKernelStack: 4936 kBPageTables: 43928 kBNFS_Unstable: 0 kBBounce: 0 kBWritebackTmp: 0 kBCommitLimit: 4038280 kBCommitted_AS: 8365088 kBVmallocTotal: 34359738367 kBVmallocUsed: 357696 kBVmallocChunk: 34359349628 kBHardwareCorrupted: 0 kBAnonHugePages: 0 kBHugePages_Total: 0HugePages_Free: 0HugePages_Rsvd: 0HugePages_Surp: 0Hugepagesize: 2048 kBDirectMap4k: 210384 kBDirectMap2M: 8056832 kB

Page 61: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Memory metrics

● cat/sys/fs/cgroup/memory/memory.stat

● slightly better...

cache 2700967936rss 1977851904rss_huge 0mapped_file 275435520swap 0pgpgin 104106567pgpgout 102964277pgfault 140459553pgmajfault 3858inactive_anon 168316928active_anon 1993658368inactive_file 1258921984active_file 1257791488unevictable 0hierarchical_memory_limit 9223372036854775807hierarchical_memsw_limit 9223372036854775807total_cache 2700967936total_rss 1977851904total_rss_huge 0total_mapped_file 275435520total_swap 0total_pgpgin 104106567total_pgpgout 102964277total_pgfault 140459553total_pgmajfault 3858total_inactive_anon 168316928total_active_anon 1993658368total_inactive_file 1258921984total_active_file 1257791488total_unevictable 0

Page 62: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Memory metrics

● cat/sys/fs/cgroup/memory/memory.stat

● this looks better

cache 2700967936rss 1977851904rss_huge 0mapped_file 275435520swap 0pgpgin 104106567pgpgout 102964277pgfault 140459553pgmajfault 3858inactive_anon 168316928active_anon 1993658368inactive_file 1258921984active_file 1257791488unevictable 0hierarchical_memory_limit 9223372036854775807hierarchical_memsw_limit 9223372036854775807total_cache 2700967936total_rss 1977851904total_rss_huge 0total_mapped_file 275435520total_swap 0total_pgpgin 104106567total_pgpgout 102964277total_pgfault 140459553total_pgmajfault 3858total_inactive_anon 168316928total_active_anon 1993658368total_inactive_file 1258921984total_active_file 1257791488total_unevictable 0

Page 63: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Memory metrics

● cat/sys/fs/cgroup/memory/memory.stat

● but you want this

cache 2700967936rss 1977851904rss_huge 0mapped_file 275435520swap 0pgpgin 104106567pgpgout 102964277pgfault 140459553pgmajfault 3858inactive_anon 168316928active_anon 1993658368inactive_file 1258921984active_file 1257791488unevictable 0hierarchical_memory_limit 9223372036854775807hierarchical_memsw_limit 9223372036854775807total_cache 2700967936total_rss 1977851904total_rss_huge 0total_mapped_file 275435520total_swap 0total_pgpgin 104106567total_pgpgout 102964277total_pgfault 140459553total_pgmajfault 3858total_inactive_anon 168316928total_active_anon 1993658368total_inactive_file 1258921984total_active_file 1257791488total_unevictable 0

Page 64: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Large Dosesof

Network Traffic

Page 65: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Networking

● sometimes, you need to expose range of ports● or completely arbitary ports● or non-IP protocols● or you have special needs:

– more than 1 Gb/s in containers– more than 1,000 connections/s

– more than 100,000 concurrent connections

Page 66: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Get rid of the overhead

● use openvswitch● bridge a container directly with a NIC

(remove iptables out of the equation)● move a (macvlan) NIC to a container

(a bit of overhead; multi-tenant)● move a (physical) NIC to a container

(zero overhead; single-tenant)● move a (virtual function) NIC to a container

(if your hardware supports it)

Page 67: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Page 68: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Pipework

● sidekick script for Docker● I replaced the plumber with

a very small small shell script

pipework br1 $APACHE 192.168.1.1/24

pipework br1 $MYSQL 192.168.1.2/24

pipework br1 $CONTAINERID 192.168.4.25/[email protected]

pipework eth2 $(docker run -d hipache) 50.19.169.157

pipework eth3 $(docker run -d hipache) 107.22.140.5

pipework br0 $(docker run -d zmqworker) dhcp fa:de:b0:99:52:1c

Page 69: Introduction to Docker (and a bit more) at LSPE meetup Sunnyvale

#lspe

Thank you! Questions?

http://docker.io/

http://docker.com/

@docker

@jpetazzo