Exploring the U.S. National Bridge Inventory with Neo4j

mckenzma · September 1, 2020, 2:05pm

Hi eveyone,
I just posted my first blog in a series Exploring the U.S. National Bridge Inventory with Neo4j over on Medium. This first one is just a very brief intro to the why I wanted to do this. Each successive post will be about the next step in importing the data, building the model, running analysis, and making a GRANDstack application. I gave a lightning talk at NODES 2019 on this project. I have been making adjustments since then and will elaborate more of my process with each post.
Cheers,
Michael

mckenzma · September 6, 2020, 5:18pm

Just posted Part 2 here.

Much more to come.

Cheers!

tony.chiboucas · September 8, 2020, 10:45pm

Out of curiosity, why store all those file/row details at all, when you could go straight to the representation you really care about?

I'm thinking I'll show you what I mean after you post Part 3.

mckenzma · September 17, 2020, 8:31pm

Tony,
That's a fair question. I made the decision to store the raw data like this in order to do some data comparisons and to make the importation easier.

With regards to the data comparisons, when I first started working with this data I was looking at both the delimited and non-delimiter files. What I found initially (though not admittedly a thorough exploration) was that the files didn't match as one would expect. I wanted to do a deeper dive to understand what the differences were. I figured if the data looked different between 2 files that should technically be the same, then it is possible to the analysis could be different depending on which file you chose to use. I only have a basic query for that comparison. I may cover that in a later post.

As for the storing the raw data, this is something I picked up in my current job. My exploration of the data has only gone so far. That is one reason I wanted to do the series, have people take the journey with me. Since I do not know the final schema I need to be able to quickly build. And storing the raw data is just one solution for that.

mckenzma · November 24, 2020, 7:00pm

Part 3 is up if you are interested on Medium

mckenzma · May 17, 2021, 10:16pm

Been delayed on getting this out. Part 4 is up

Topic		Replies	Views
Sam from New York enters the room Introduce-Yourself	7	1147	April 23, 2019
Jesse Hemingway Introduce-Yourself	7	990	September 10, 2018
Hi from Idaho! ~Melissa Introduce-Yourself	2	321	May 5, 2020
Greetings from NC, USA Introduce-Yourself	2	709	September 11, 2019
Experimenting with a different storage format for Neo4j Neo4j Graph Platform performance	1	464	August 27, 2020

Get Certified in June!

Exploring the U.S. National Bridge Inventory with Neo4j

Related topics