Simple solution, huge savings

by Dongeun Paeng, Senior Partner

1. The origin of the issue

In any rapidly growing company, data accumulates quickly. Industries that don't necessarily need a dedicated development team (like manufacturing or distribution) often use off-the-shelf databases. As a result, the performance of their dashboards deteriorates, and they find themselves unable to perform cohort or statistical analysis. However, this is not necessarily a bad sign; it means the business has scaled to a point where data management becomes a critical issue.

The client who approached Da Vinci Technologies was no exception. They had accrued over 2.2 million purchase records, which their data analysts were downloading in chunks to Excel for analysis. After hearing the client's complaints of sluggish performance and difficulties in finding errors in their data, we took a closer look at their database. They were storing all purchase data in a single large table, resulting in query times exceeding 70 seconds for simple search tasks.

By tuning simple queries, we were able to reduce query times by over 10 seconds. (After the entire project, the same query time went down to 0.175s.)

After witnessing tangible improvements, the client hired Da Vinci Technologies.

2. Database Normilisation

What Da Vinci Technologies did can be summarized in two words: "Database Normalisation."

We launched a separate database instance for analytics and created about ten tables tailored for analytic purposes. Remember this when creating a well-normalized database for analytics - "You shouldn't need to use 'join' statements when analyzing."

Following the database redesign, we performed an ETL (Extract, Transform, Load) process to move relevant data from the old database.

Cohort analysis, which was crucial for the client, is a complex task that requires thoughtful database design. The output for cohort analysis grows horizontally (M1, M2, M3,...), but databases should be designed to grow vertically.

If you are a team lacking in development resources and want a blazingly fast database for analytics, you'll need to: (1) Write SQL scripts (2) Launch an analytics server (3) Create cronjobs (4) Integrate with dashboard tools like Tableau.

The expertise required is so broad that it's impossible for a single backend developer, frontend developer, data engineer, or database architect to handle it alone.

Most clients we meet at Da Vinci Technologies complain about difficulties in data analysis, querying, or modification. While they may refer to this as Admin, Backoffice, CMS, or ERP, the essence is the same: poor database management.

Conclusion

Are you finding it difficult to navigate your data? That's not necessarily a bad thing. Having abundant data and a reason to look at it usually indicates business growth.

No technological issue is insurmountable in business. With the right team, even the most challenging problems can eventually be solved.

And sometimes, a cost-free diagnosis alone can offer easy and quick solutions to improve your database management.

If you feel your database management isn't keeping pace with your business growth, feel free to contact us.

Until next time,

Dongeun

More articles

Do you know your exact developer needs?

Get it wrong, and you'll either overstaff or understaff your team.

Read more

All you must know about SMR

Small Modular Reactor - what is so important about it?

Read more

Tell us what you need

Our offices

  • Seoul
    13, Bongeunsa-ro 82-gil
    Gangnam-gu, Seoul, South Korea
  • Palo Alto
    3223 Hanover Street
    Palo Alto, California 94304, United States