Working with Firebase Documents in ClojureScript
In a project I’m currently working on we’re making use of Google's Firebase to store domain data and run cloud functions.
In Firestore, which is Firebase’s database offering, every
document is essentially a Javascript object. While interop in
ClojureScript is pretty good we ended up converting the raw data
of these documents to ClojureScript data structures using
js->clj
. This also meant we’d need to convert them
back to JS objects before writing them to Firestore.
Because IDs are technically not part of the document the project adopted a pattern of representing documents as tuples:
[id (js->clj firestore-data)]
This works but isn’t particularly extensible. What if we also wanted to retain the “Firestore Reference” specifying a documents location inside the database? (Firestore stores data in a tree-like structure.)
It also leads to some funky gymnastics when working with collections of documents:
(sort-by (comp :join_dt second) list-of-document-tuples)
Could be worse... but also could be better.
This blogpost will compare various approaches approach to address
the problems above using
cljs-bean, basic
ClojureScript data structures, custom protocols and
:extend-via-metadata
.
cljs-bean
With the recent release of
cljs-bean we
have an interesting alternative to js->clj
.
Instead of eagerly walking the structure and converting all values
to their ClojureScript counterparts (i.e. persistent data
structures) the original object is wrapped in a thin layer that
allows us to use it as if it were a ClojureScript-native data
structure:
(require '[cljs-bean.core :as cljs-bean])
(-> (cljs-bean/bean #js {"some_data" 1, :b 2})
(get :some_data)) ; => 1
Given a Firestore QueryDocumentSnapshot we can make the JS object representing the data easily accessible from ClojureScript:
(-> (cljs-bean/->clj (.data query-document-snapshot))
(get :some_field))
;; (cljs-bean/->clj data) is roughly the same as
;; (cljs-bean/bean data :recursive true)
The bean is immutable and can be used in client side app-state as if it is one of ClojureScript’s persistent data structures.
Caveat: Updating a bean using
assoc
or similar will create a copy of the object
(Copy-on-Write). This is less performant and more GC intensive
than with persistent data structures. Given that the data is
usually quite small and that the document representations in our
app state mostly aren’t written to directly this is probably ok
(cljs-bean #72).
Whenever we want to use the raw object to update data in Firestore
we can simply call ->js
on the bean. Conveniently
this will fall back to clj->js
when called on
ClojureScript data structures.
(.set some-ref (cljs-bean/->js our-bean))
Arguably the differences to using plain
clj->js
aren’t monumental but working with a
database representing data as JS objects it is nice to retain
those original objects.
Integrating Firestore Metadata
Now we got beans. But they still don’t contain the document ID or reference. In most places we don’t care about a documents ID or reference. So how could we enable the code below while retaining ID and reference?
(sort-by :join_dt participants)
Let’s compare the various options we have.
Tuples and Nesting
I already described the tuple-based approach above. Another, similar, approach achieves the same by nesting the data in another map. Both fall short on the requirement to make document fields directly accessible.
;; structure
{:id "some-id", :ref "/events/some-id", :data document-data}
;; usage (including gymnastics)
(sort-by (comp :join_dt :data) participants)
I’m not too fond of either approach since they both expose a specific implementation detail, that the actual document data is nested, at the call site. In a way my critique of this approach is similar to why Eric Normand advocated for getters in his IN/Clojure ’19 talk — as far as I understand anyways.
Addition of a Special Key
Another approach could be to add metadata directly to the document data.
(defn doc [query-doc-snapshot]
(-> (cljs-bean/->clj (.data query-doc-snapshot))
(assoc ::meta {:id (.-id query-doc-snapshot
:ref (.-ref query-doc-snapshot})))
This is reasonable and makes document fields directly accessible. However it also requires us to separate document fields and metadata before passing the data to any function writing to Firestore.
;; before writing we need to remove ::meta
(.set some-ref (cljs-bean/->js (dissoc document-data ::meta))
I think this is a reasonable solution that improves upon some of the issues with the tuple and nesting approach. I realize that this isn’t a huge change but this inversion of how things are nested does give us that direct field access that the nesting approach did not.
Protocols and :extend-via-metadata
An approach I’ve found particularly interesting to play with makes
use of a protocol that can be implemented via metadata, as enabled
by the new :extend-via-metadata
option. This
capability was added in
Clojure 1.10
and subsequently added to ClojureScript with the
1.10.516 release:
(defprotocol IFirestoreDocument
:extend-via-metadata true
(id [_] "Return the ID (string) of this document")
(ref [_] "Return the Firestore Reference object"))
(defn doc [query-doc-snapshot]
(with-meta
(cljs-bean/->clj (.data query-doc-snapshot))
{`id (fn [_] (.-id query-doc-snapshot))
`ref (fn [_] (.-ref query-doc-snapshot))}))
Using with-meta
we extend a specific instance of a
bean to implement the IFirestoreDocument
protocol.
This allows direct access to document properties while retaining
important metadata:
(:name participant) ; => "Martin"
(firebase/id participant) ; => "some-firebase-id"
At call sites we use a well-defined API (defined by the protocol) instead of reaching into nested maps whose structure may need to change as our program evolves. This arguably could also be achieved with plain functions.
Sidenote: A previous iteration of this used
specify!
. Specify modifies the bean instance however,
meaning that whenever we’d update a bean the protocol
implementation got lost. In contrast metadata is carried over
across updates.
Summary
Using cljs-bean we’ve enabled idiomatic property access for JS data structures without walking the entire document and converting it to a persistent data structure. We also retain the original Javascript object making it easy to use for Firestore API calls.
We’ve compared different ways of attaching additional metadata to
those documents using compound structures as well as the new and
shiny :extend-via-metadata
. Using it we’ve extended
instances of beans to support a custom protocol allowing open
ended extension without hindering the ergonomics of direct
property access.
While I really enjoyed figuring out how to extend beans using
:extend-via-metadata
it turned out that any approach
storing data in “unusual places” (i.e. metadata) causes notable
complexity when also wanting to serialize the data.
Serializing metadata is something that has been added to Transit quite some time ago but compared to the plug and play serialization we get when working with plain maps it did not seem worth it. Even if set up properly the protocol implementations, which are functions, are impossible to serialize.
Ultimately we ended up with plain beans and storing metadata under a well known key that is removed before writing the data to Firestore again:
(defn doc [query-doc-snapshot]
(-> (cljs-bean/->clj (.data query-doc-snapshot))
(assoc ::meta {:id (.-id query-doc-snapshot)
:ref (.-ref query-doc-snapshot)})))
(defn id [doc]
(-> doc ::meta :id))
(defn ref [doc]
(-> doc ::meta :ref))
(defn data [doc]
(cljs-bean/->js (dissoc doc ::meta)))
If you're using Firebase or comparable systems, I'd be curious to hear if you do something similar on ClojureVerse.
Thanks to Matt Huebert and Mike Fikes for their feedback & ideas.