Replaying edits

Once we have extracted the edits that have happened to a neuron (see here), it can be helpful to replay them in order to see how the neuron has changed over time.

Extract the edits and initial state of this neuron¶

import networkx as nx
from tqdm.auto import tqdm

from caveclient import CAVEclient
from paleo import get_initial_graph, get_root_level2_edits

As in the previous example, we'll start by extracting the edits to a neuron.

root_id = 864691135639556411

client = CAVEclient("minnie65_public", version=1078)

networkdeltas = get_root_level2_edits(root_id, client)

Extracting level2 edits:   0%|          | 0/693 [00:00<?, ?it/s]

This time, we'll also use paleo.get_initial_graph to get the level2 graph connectivity for all objects that participate in this neuron's edit history. This will allow us to replay the edits in the context of the full segmentation graph.

initial_graph = get_initial_graph(root_id, client)

/Users/ben.pedigo/code/cave-edits/paleo/.venv/lib/python3.12/site-packages/networkx/readwrite/json_graph/node_link.py:287: FutureWarning: 
The default value will be changed to `edges="edges" in NetworkX 3.6.

To make this warning go away, explicitly set the edges kwarg, e.g.:

  nx.node_link_graph(data, edges="links") to preserve current behavior, or
  nx.node_link_graph(data, edges="edges") for forward compatibility.
  warnings.warn(

Getting initial graph:   0%|          | 0/309 [00:00<?, ?it/s]

Replaying the edits over the level2 graph¶

The simplest thing we can do now is to replay the edits in order. paleo provides the apply_edit function that takes in the graph and an edit and applies it to the graph. Note that this modifies the graph in place.

from paleo import apply_edit

deltas = list(networkdeltas.values())

graph = initial_graph.copy()
for delta in tqdm(deltas, disable=False):
    apply_edit(graph, delta)

  0%|          | 0/693 [00:00<?, ?it/s]

As a sanity check, we might want to compare the graph that we got from replaying edits from the original, to the actual graph that we'd get from client.chunkedgraph.level2_chunk_graph.

To do so, we need to also know a point on the object of interest to use as an anchor point - this is because typically graph will be composed of many connected components, but only one of them corresponds to the current state of our neuron.

from paleo import get_nucleus_supervoxel

nuc_supervoxel_id = get_nucleus_supervoxel(root_id, client)

nuc_level2_id = client.chunkedgraph.get_roots(nuc_supervoxel_id, stop_layer=2)[0]

neuron_component = nx.node_connected_component(graph, nuc_level2_id)
neuron_graph = graph.subgraph(neuron_component)

computed_edgelist = nx.to_pandas_edgelist(neuron_graph).values.astype(int)

final_edgelist = client.chunkedgraph.level2_chunk_graph(root_id)

It's assuring to see that we at least have the same number of edges in both cases.

len(final_edgelist), len(computed_edgelist)

(10210, 10210)

...and when we compare the actual edgelists element-wise, we see that they are the same.

import numpy as np

final_edgelist = np.unique(np.sort(final_edgelist, axis=1), axis=0)
computed_edgelist = np.unique(np.sort(computed_edgelist, axis=1), axis=0)

(final_edgelist == computed_edgelist).all()

True

Tracking neuron state over the edit history¶

Now, let's try keeping track of the state of the neuron at every point along this edit history.

This becomes just a bit more complicated: often the level2 ID corresponding to a nucleus's location may change over time if there was an edit near that location. If we want to keep track of the segmentation component corresponding to the nucleus (or some other point) over this whole history, then we need to know how this ID changes over time. paleo provides the get_node_aliases function to help with this.

from paleo import get_node_aliases

node_info = get_node_aliases(nuc_supervoxel_id, client, stop_layer=2)
node_info

	start_valid_ts	end_valid_ts
node_id
161513998385152439	2020-08-01 13:07:22.739000+00:00	2024-06-05 10:10:01.203215+00:00
161513998385152001	2020-05-29 13:26:43.761000+00:00	2020-08-01 13:07:22.738999+00:00

Now we have all the ingredients to replay the edits and keep track of the neuron's state.

def find_level2_node(graph, level2_ids):
    for level2_id in level2_ids:
        if graph.has_node(level2_id):
            return level2_id
    return None


# keep track of components that are reached as we go
components = []

# store the initial state
nucleus_node_id = find_level2_node(graph, node_info.index)
component = nx.node_connected_component(graph, nucleus_node_id)
components.append(component)

# after each edit, apply it and store the connected component for the nucleus node
for delta in tqdm(deltas, disable=False):
    apply_edit(graph, delta)
    nucleus_node_id = find_level2_node(graph, node_info.index)
    component = nx.node_connected_component(graph, nucleus_node_id)
    components.append(component)

  0%|          | 0/693 [00:00<?, ?it/s]

from paleo import get_component_masks

l2_masks = get_component_masks(components)
l2_masks

	0	1	2	3	4	5	6	7	8	9	...	684	685	686	687	688	689	690	691	692	693
150388177863966928	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
150458546608144530	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
150528846632845407	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
150528846632845424	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
150528915352323074	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
181993776573580123	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
181993845293056671	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
181993845293057025	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
182064145317757549	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True
182064214037234457	True	True	True	True	True	True	True	True	True	True	...	True	True	True	True	True	True	True	True	True	True

15544 rows × 694 columns

Simplifying the process¶

The resolve_edit function simplifies some of this boilerplate code by taking in the graph, the edit, and a list of nodes to check to "anchor" the edit. In our case, this was the level2 IDs corresponding to the nucleus point. It also simplifies the code if we add an element to our deltas dictionary mapping -1 to None, which denotes the original state of the neuron before any edits were applied.

from paleo import resolve_edit

# keep track of components that are reached as we go
components = []
# remember to include the initial state
networkdeltas = {-1: None, **networkdeltas}

# after each edit, apply it and store the connected component for the nucleus node
for edit_id, delta in tqdm(networkdeltas.items(), disable=False):
    component = resolve_edit(graph, delta, node_info.index)
    components.append(component)

  0%|          | 0/694 [00:00<?, ?it/s]

The above syntax is helpful if you want to have some control over what happens at each stage of the process, or if you want to keep track of particular information at each stage. If you just want the level2 nodes or level2 graph at each stage, you can use the apply_edit_sequence function, which is a wrapper around this resolve_edit loop.

This method returns a dictionary mapping the edit ID to the state of the neuron after applying that edit. By default, this function will include the level2 nodes at each state of the neuron's history.

from paleo import apply_edit_sequence

nodes_by_state = apply_edit_sequence(graph, networkdeltas, node_info.index)

len(nodes_by_state[9028])

  0%|          | 0/694 [00:00<?, ?it/s]

If you need to keep the actual connectivity of the level2 graph at each stage, then instead pass in return_graph=True. This will return a dictionary mapping the edit ID to the level2 graph at that stage. This version is a bit slower since it makes a copy of the graph at each edit.

from paleo import apply_edit_sequence

graphs_by_state = apply_edit_sequence(
    graph, networkdeltas, node_info.index, return_graphs=True
)

graphs_by_state[9028]

  0%|          | 0/694 [00:00<?, ?it/s]

<networkx.classes.graph.Graph at 0x3b5cc6ae0>