class: center, middle # Building distributed applications: riak_core vs partisan Mariano Guerra [@warianoguerra](https://twitter.com/warianoguerra) [instadeq.com](https://instadeq.com) [@instadeq](https://twitter.com/instadeq) Erlang Meetup London 2018 --- # Questions * Who are we? * How do we share knowledge? * How do I communicate with someone? * Who is responsible for this? --- ``` Riak Core Partisan +--------------+ +------------+ +-----------------------------+ | | | | | | | RC Metadata | | RC Ring | | Bring Your Own | | | | | | | +--------------+ +------------+ +-----------------------------+ +--------------+ +------------+ +-----------------------------+ | | | | | | | Plumtree | | RC Gossip | | Plumtree | | | | | | | +--------------+ +------------+ +-----------------------------+ +-----------------------------+ +-----------------------------+ | | | | | Distributed Erlang | | Dist Erl/Static/HyParView | | | | | +-----------------------------+ +-----------------------------+ ``` --- ``` +--------------+ +------------+ | | | | | Metadata | | Routing | | | | | +--------------+ +------------+ +-----------------------------+ | | | Gossip | // Provides broadcast | | // communication +-----------------------------+ +-----------------------------+ | | | Membership | // Provides point to point | | // communication +-----------------------------+ ``` --- ``` Riak Core Partisan +--------------+ +------------+ +------------+ +--------------+ | | | | | | | | | RC Metadata | | RC Ring | | Metadata | | RC Ring | | | | | | | | | +--------------+ +------------+ +------------+ +--------------+ +-----------------------------+ +-----------------------------+ | | | | | Plumtree | | Plumtree | | | | | +-----------------------------+ +-----------------------------+ +-----------------------------+ +-----------------------------+ | | | | | Distributed Erlang | | DistErl/HyParView | | | | | +-----------------------------+ +-----------------------------+ ``` --- # Who are we? (Membership) ``` +-----------------------------+ | | | Membership | | | +-----------------------------+ ``` --- # Membership: Riak Core Distributed Erlang * Nodes loosely connected * Connect on first usage + spawn(Node,M,F,A) + net_adm:ping(Node) * Transitive connections + can be disabled with `-connect_all false` --- # Membership: Riak Core (cont.) * Disconnection removes a node + Node down + erlang:disconnect_node(Node) * Nodes connect all to all (mesh) --- # Membership: Riak Core (cont.) * Erlang Port Mapper (EPMD) started on each node + Maps symbolic node names to host/port * List visible nodes() + nodes(hidden) `-hidden` flag + nodes(connected) --- # Membership: Partisan Configurable * Default (Distributed Erlang) * HyParView * Static * Client/Server --- class: center, middle data:image/s3,"s3://crabby-images/14cee/14cee3262e1961e868954249afcba877c0b41e6c" alt="hyparview paper" --- # Membership: Partisan (HyParView) * Active View: size = log(n) + c * Passive View: k(log(n) + c) * Assumes reliable transport protocol like TCP For c = 1 and k = 6 * Network of 10.000 nodes * Active views size is 5 * Passive views size is 30 --- # Membership: Partisan (HyParView) * Active connection to every node on active view * On msg receive, forwards to nodes in active view * Tests connection on each broadcast * Removes from active view if connection fails * Promotes node from passive view * Periodic shuffle with random nodes + Shares some nodes from active view, all from passive view --- # How do I communicate with someone? --- # Riak Core ! or OTP behaviors --- # Partisan ```erlang partisan_peer_service:forward_message(NodeName, RemotePidOrRegisteredName, Message).``` --- # Who is responsible for this? ``` +--------------+ +------------+ | | | | | Metadata | | Routing | | | | | +--------------+ +------------+ ``` --- class: center, middle # Riak Core "Dynamo" data:image/s3,"s3://crabby-images/cc0c7/cc0c72e7e85f927d0fec07c00b72fdc89eac5f61" alt="dynamo paper" --- class: center, middle # Riak Core "Dynamo" data:image/s3,"s3://crabby-images/05cea/05cea4c8512ca77350d75f95d2c0d3e394b2a999" alt="dynamo ring" --- # Riak Core ```erlang DocIdx = riak_core_util:chash_key(Key), PrefList = riak_core_apl:get_primary_apl(DocIdx, 1, sdb), [{IndexNode, _Type}] = PrefList, riak_core_vnode_master:sync_spawn_command(IndexNode, Cmd, sdb_vnode_master). ``` --- # How do we share information? ``` +-----------------------------+ | | | Gossip | | | +-----------------------------+ ``` --- * Riak Core + Plumtree (Riak Core Metadata) * Partisan: + Plumtree --- # Riak Core Metadata * Namespaced Key-Value Store * Replicated to all nodes + Eventually + Asynchronously * In memory (ETS) * On disk (DETS) * Local Read/Writes --- class: center, middle data:image/s3,"s3://crabby-images/1712d/1712d12a93d450860b3ab23db66cddfc5e94e71d" alt="plumtree paper" --- # Plumtree Node State * Eager Set * Lazy Set * Seen Msgs --- class: center # Eager Push data:image/s3,"s3://crabby-images/c4f0b/c4f0bd13ec2332395a2b1599a3f5cd5afff14772" alt="plumtree-0" --- class: center # Eager Push data:image/s3,"s3://crabby-images/e0324/e0324cbfc8bc09c4b76b87ba41684648d0114b79" alt="plumtree-1" --- class: center # Eager Push data:image/s3,"s3://crabby-images/403d6/403d643e942d0dde836355954baa0ce09380fc1b" alt="plumtree-2" --- class: center # Eager Push data:image/s3,"s3://crabby-images/b2399/b2399c63dad13b3385dbc8862fa97c40b5fc9fa2" alt="plumtree-3" --- class: center # Eager Push data:image/s3,"s3://crabby-images/65f58/65f58d400bb812bb33d0eb2018852ac3ed8a6fb8" alt="plumtree-4" --- class: center # Eager Push data:image/s3,"s3://crabby-images/2b931/2b9318917fccb3fe2e04125db0230054401553c3" alt="plumtree-5" --- class: center # Eager Push data:image/s3,"s3://crabby-images/2b6ff/2b6ffe1d2a14950fca49c0c9d75052afd6a42903" alt="plumtree-6" --- class: center # Eager Push data:image/s3,"s3://crabby-images/70ac2/70ac2c3350c61f2bd7b7bdbbe41f135f37c0b611" alt="plumtree-7" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/0859c/0859c18cc80f922caefb6fd8ee3470614212f93d" alt="plumtree-lazy-0" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/6ccc3/6ccc34d544f3d90163050003a929071a3abb7023" alt="plumtree-lazy-1" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/a3466/a346691a76910a2fb09b799b97bb58b9c6ff02ea" alt="plumtree-lazy-2" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/5c15f/5c15f5f1addafce9bf00921f3aac88dd3b2c2766" alt="plumtree-lazy-3" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/e86fd/e86fdff8b357fc05ca0681993483087e9e28faf9" alt="plumtree-lazy-4" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/e757b/e757bc1b5970a26bd035c2b7f38116958930a490" alt="plumtree-lazy-5" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/2cb2c/2cb2c87f4699a6073c955dbfd2f7ca3129b6c218" alt="plumtree-lazy-6" --- class: center # Lazy Push data:image/s3,"s3://crabby-images/81d46/81d46cb190bcf7afa72e03ae5145a6c6a192e305" alt="plumtree-lazy-7" --- # Active Anti Entropy (AAE) How does node 3 gets the values broadcasted while it was down? --- # Merkle Tree Tree in which every non-leaf node is labelled with the hash of the labels or values (in case of leaves) of its child nodes --- # Merkle Tree * Git * IPFS * ZFS, Btrfs * BitTorrent * Bitcoin * Cassandra, Riak --- class: center # Hash Tree data:image/s3,"s3://crabby-images/69638/69638cf35315c6f591ce56ffcc43d55712f86b3e" alt="hashtree" --- class: center # Segments data:image/s3,"s3://crabby-images/8b3df/8b3dfb2dfc27a14ca4eec5bc82ef3938eaf8a0dd" alt="hashtree-segments" --- class: center # Segment Hashes data:image/s3,"s3://crabby-images/44701/4470144178ed9b8100acc0dcf7f5f543270eec4a" alt="hashtree-segment-hashes" --- class: center # Upper Hashes data:image/s3,"s3://crabby-images/023d7/023d7a719995508ed27ca26d57b6b29293d928b6" alt="hashtree-upper-hashes" --- # Hash Tree Operations * Insert * Update * Compare --- class: center # Insert data:image/s3,"s3://crabby-images/eea5e/eea5e333d926aed21e1f8189473d6dcb648381b4" alt="hashtree-insert-0" --- class: center # Insert data:image/s3,"s3://crabby-images/47031/470315d4c4b2fe52d0fd3e10273b395961107fbb" alt="hashtree-insert-1" --- class: center # Insert data:image/s3,"s3://crabby-images/02635/02635cb8d7fea36ad660636f050d304620dd09bd" alt="hashtree-insert-2" --- class: center # Insert data:image/s3,"s3://crabby-images/1a033/1a03355eb152c6225a7efe552ebc82bc81372ef2" alt="hashtree-insert-3" --- class: center # Insert data:image/s3,"s3://crabby-images/0291b/0291be850634d5eb0cd077cff08ddfb1f5b41b56" alt="hashtree-insert-4" --- class: center # Update data:image/s3,"s3://crabby-images/1252c/1252cc02eda484cf4713258a60c125a3cdcf8ca8" alt="hashtree-insert-5" --- class: center # Update data:image/s3,"s3://crabby-images/92093/92093b0e809d8dcc274233ccf0b56fd8ac59a38d" alt="hashtree-insert-6" --- class: center # Update data:image/s3,"s3://crabby-images/96eac/96eacdbb4159405810aee141d361cfdf23564b11" alt="hashtree-insert-7" --- class: center # Update data:image/s3,"s3://crabby-images/ecf30/ecf3022e9ea90378ab6e722b34da3cef8a21cbc4" alt="hashtree-insert-8" --- class: center # Update data:image/s3,"s3://crabby-images/a6a2b/a6a2b22d4a69ca28dc490f211a67fcc0204987ad" alt="hashtree-insert-9" --- class: center # Update data:image/s3,"s3://crabby-images/b7dd0/b7dd05851d0246bb9e09777e55ad9808a0de6844" alt="hashtree-insert-10" --- class: center # Compare data:image/s3,"s3://crabby-images/e8ec0/e8ec0a4eb934716803d975b41447326c0927abf1" alt="hash-compare-1" --- class: center # Compare data:image/s3,"s3://crabby-images/af2c6/af2c63635f56e13fc88530f050028be23f063137" alt="hash-compare-2" --- class: center # Compare data:image/s3,"s3://crabby-images/51924/519247ea36af470b20ba375fb629dad6caa0aaa9" alt="hash-compare-3" --- class: center # Compare data:image/s3,"s3://crabby-images/f386b/f386b98af8b955da302e98e190d9184da83f8f3b" alt="hash-compare-4" --- class: center # Compare data:image/s3,"s3://crabby-images/1f097/1f097b469b745419993374134cba3b644a96d941" alt="hash-compare-5" --- class: center # Demo Time --- # Thanks [github.com/marianoguerra-atik/2018-london-erlang-meetup](https://github.com/marianoguerra-atik/2018-london-erlang-meetup) [github.com/marianoguerra-atik/riak_core_ring_on_partisans_plumtree](https://github.com/marianoguerra-atik/riak_core_ring_on_partisans_plumtree) [marianoguerra.github.io/presentations/2018-london-erlang-meetup](https://marianoguerra.github.io/presentations/2018-london-erlang-meetup) Mariano Guerra [@warianoguerra](https://twitter.com/warianoguerra) [instadeq.com](https://instadeq.com) [@instadeq](https://twitter.com/instadeq)