3 Mental Models of APIs

Viewing this by yourself? Hit 's' to see my speaker's notes.

API as...

Persistence layer ("REST"/GraphQL/Graph APIs generally)

Namespace of functions (RPC, ie gRPC)

State machine (REST)

Ok, so, what's the best model?

"Essentially, all models are wrong, but some are useful." — Box, George E. P.; Norman R. Draper (1987). Empirical Model-Building and Response Surfaces, p. 424, Wiley. ISBN 0471810339.

🤔

When building an API, what abstraction level should we expose to the client(s) for this piece of functionality?

Well, what does the client want? Or what do we want the client to want? Or will we have multiple clients, with competing "wants"?

Personal Opinion Alert

⚠

Let's put the "Application" back in Application Programming Interface.

Reality: your API is probably gonna do a bit of all 3.

If you're serving multiple, potentially thick, clients: you likely want some generic data exposure (like "REST" or GraphQL).
If you're providing unsequenced commands clients can choose how to present but you implement (and you can't just release as a local module): you need some RPC.
If you're providing sequenced workflows (like, y'know, business processes): you want REST with HATEOAS.

Ok dude. Why do you keep putting REST in scare quotes sometimes?

REST has a maturity model.

😒 So..?

So fully "mature" REST has to obey the hypermedia constraint.

⬇️ 🏭 resource state

✨✨ implicit program state ✨✨

application state 🎮 ⬆️

For that, I need to introduce a couple other terms. Resource state, and application state. Most people think of REST as focusing on resource state, and being resource-centric. (Ask audience: what is resource state?) Resource state is the server's state. The client will never have direct access to it. Instead they'll receive representations. So we tend to think of resources as 1:1 with objects, and REST as the CRUD we can do to those objects. In fact, originally the only restriction on what can be a resource is that it has a URL. So literally: can you dereference this? Then it's a resource. (Ask audience: what is application state?) Application state is what's on the client, and likewise, the server will never have direct access to it. In the simplest possible case, the application state on the client is the last representation of resource state a client received from the server, which encompasses likely some object's state and crucially the state transitions the server will accept to change state. Between these two states, your application is sitting on some state in the overall state machine. I want to encourage you to try thinking less in terms of resource state, and more in terms of application state, especially in state transitions.

Here's a pretty simple graph. I think the way most people look at RESTful APIs today, they see each node as a representation of persisted object state. The only links people concieve of are at the object relation level. So something like GraphQL extends and seems to supercede that view by giving you a flexible language to redefine objects and traverse an object graph. As a side note, I want to point out that's not really new. If you're curious, check out SPARQL. Anyway this object relation model isn't wrong. But I'm using this graph to represent the state machine of our application on the server. So the red is a path of execution through that program. You can think of the edges as simple links, but it's important to see them also as more advanced controls like forms. Imagine this is showing us a business process we provide. Every node is an application state and every edge between nodes is a state transition. Technically speaking, every node is a representation of a resource, and every edge is a link relation. Speaking less formally, every node is roughly a page, and every edge is an action to change the page. The content of any node may be made up of multiple different objects. This is possible because resources don't have to map to our object model 1:1. A resource is free to be a higher level abstraction. One key thing to observe about this state machine graph is that you can negotiate with a client about how much of it to send over at once. If you want totally dynamic binding, you can just give your client a single node at a time. But for clients that want fewer requests, you can let them request subgraphs or even the entire graph at a single request. That's called transclusion and we can chat about it a bit more if you're curious.

TL;DR: try state machines instead of entity-relations

etsy: data integrity vs user path

Navigation is based on category.

But what if a product lacks a category? 😬

(second fragment) Now, data integrity wise, a product doesn't have to have a category to be valid. But users can't easily reach any products without a category. If we think about the application state graph a user has to traverse, we can see immediately that products without categories are at a huge disadvantage--you can only see them if you're already browsing a store. You can't see that though if you're thinking about relationships between objects, because you're not thinking about them in the context that they will be used in, their semantics. Let's pretend another client for Etsy was written and that client didn't organize products by categories first, let's pretend instead they organized by vendor, because all products absolutely must have a vendor to be valid. This new user path is the dashed red line. I'd say that's a bad client design based on this graph, because the user has to traverse more edges to get to the buy state, and that's our real target state.

6

3

hypermedia process

write out all input/output between the client and server
circle inputs and outputs into nodes as appropriate
draw edges between every node where an interaction must happen
label every edge (input/output function) you just created

API specification is in conflict with hypermedia.


						application/vnd.siren+json

"name": "item",
"id": 22,
"actions":[
      {
         "name":"add-item",
         "title":"Add Item",
         "method":"POST",
         "href":"http://api.x.io/orders/42/items",
         "type":"application/x-www-form-urlencoded",
         "fields":[
            {
               "name":"orderNumber",
               "type":"hidden",
               "value":"42"
            },
            {
               "name":"productCode",
               "type":"text"
            },
            {
               "name":"quantity",
               "type":"number"
            }
         ]
      }
   ]

affordance

noun af·ford·ance \ə-ˈfȯr-dəns\
the qualities or properties of an object that define its possible uses or make clear how it can or should be used

"We sit or stand on a chair because those affordances are fairly obvious." — Scott Lafee, San Diego Union-Tribune, 15 Aug. 1993

affordance (hypermedia)

the qualities or properties of a representation that define its possible state changing transitions or make clear how it can or should be used to move to other program states

the simultaneous presentation of information and controls such that the information becomes the affordance through which the user (or automaton) obtains choices and selects actions — Roy T. Fielding

item → add-item → purchase: 2 clicks!

"name": "item",
"actions":[
      {
         "name":"add-item",
	}
]

"name": "add-item",
"actions":[
      {
         "name":"purchase",
	}
]

data with affordances > data without affordances

💺 > 🌲

(Humans prefer to sit on chairs.)

The child begins, no doubt, by perceiving the affordances of things for her, for her own personal behavior. She walks and sits and grasps relative to her own legs and body and hands. But she must learn to perceive the affordances of things for other observers as well as for herself. — James J. Gibson, The Ecological Approach to Visual Perception

alex moore - niemi