How to design schema denormalization to handle data changes?

Question

TuteeHUB · Accepted Answer

{"id":1075,"user_id":8,"qa_id":570,"answer_id":0,"answer":" 
I'm designing a schema for MongoDB, and keep running into scenarios where future updates might invalidate my cached copies of data. One example is for Users, Orders and Addresses.
const UserSchema = mongoose.Schema({
    addresses: [{ street: String, city: String, state: String, zip: String }]
});

const OrderSchema = mongoose.Schema({
    address: { street: String, city: String, state: String, zip: String }
});
This seems to be a standard approach, since MongoDB isn't meant to be a relational database, to denormalize the data where possible. However, the following scenario confuses me:

User adds an Address to their User document.
User places an Order and selects an Address from their list of addresses.
The address data is copied into the Order document when the order is persisted.
Before the order is shipped, the user discovers that they mistyped the address.
User changes that incorrect Address in their User document.
The Address in the Order object needs to change also, otherwise it will be shipped to an invalid address.

This seems to point towards the need for a reference using mongoose.Schema.Types.ObjectId to simulate a relational structure between the collections. (In that case, there would also be an Addresses collection, of course.) However, there are other considerations such as the history aspect of the denormalization. I want to store the actual address to which the order was shipped, even if that address is later deleted or changed. With denormalization this would seem to be easier than the relational paradigm.
One approach I considered to create an Addresses collection, then mark its records as invalidated when they are deleted, in case they are already referenced in an Order. And when they are modified, I would need to check the Orders collection to see if that Address is referenced. If it's already been referenced in a shipped order, I would have to leave it alone (for historical purposes), and create an additional Address document with the new changes. All of which sounds a bit complicated, compared to the denormalization approach.
The next part of the issue regards querying and reporting. If I want to pull up a list of all Users who have ever had an address in Illinois, I would need to traverse both the Addresses collection and the Orders in order to find out. Because they may have had an Illinois address, used it in a shipped order, then deleted it from the Addresses collection.
How do the smartest MongoDB data architects handle situations like this? I'm an experienced relational database architect, but am somewhat baffled by the conceptual framework of NoSQL. Thanks!","credit":0,"created_at":"2022-08-16T00:46:01.000000Z","updated_at":"2022-08-16T00:46:01.000000Z"}

Popular Categories

How to design schema denormalization to handle data changes?

Manpreet Singh

Answers (1)

manpreet Best Answer 2 years ago

Similar Forum

Which operating system you favour and why?

What are the most popular tech portals in India?

What are best technologies available today for education / aiding learning?

Explore Other Libraries

Online Exams

Question Bank

Career News

Feeds

Full Forms

Dictionary

Interview Question

Gigs

Quotes

Lyrics

Videos

Courses

Blogs

Tutorials

Forum

Educators

Corporates

Tools

Related Searches

Important General Tech Links

Join Our Community Today