4 min read

MDS Newsletter #93

MDS Newsletter #93

Holy data overload! Can you believe it? Two major data conferences in one week! It's like the data universe exploded and rained down on us. Did you get a chance to attend The Snowflake Summit or Databricks Data + AI Summit? If so, spill the beans! We want to hear all about it. But hey, for more data goodness, keep scrolling down and dive deeper into the world of data with the latest edition below. Let's keep the data party going!

  • Artie: Artie is an open-source platform that provides real-time data transfer services from databases to data warehouses. The platform is designed to leverage Change Data Capture (CDC) and stream processing technologies to facilitate more efficient data transfers, resulting in sub-minute latency and cost savings for data warehousing. Artie Cloud offers an easy connector setup, with no coding required, and democratizes access to data streaming, enabling businesses to gain real-time insights and make better decisions.
  • Rockset: Rockset is a cloud-based real-time analytics platform that provides rapid analytics on real-time data with remarkable efficiency. The platform offers sub-second SQL analytics, allowing users to search, aggregate, and join data at any scale. Rockset's converged indexing, built-in connectors, and cloud-native architecture make it easy for users to scale their operations in a simple and efficient manner.
    Rockset has raised a total of $61.5M in funding over 3 rounds. Their latest funding was raised on Oct 27, 2020, from a Series B round.
  • You Need A Budget: This app is designed to help individuals regain control over their finances by providing a method for managing their money. The company's aim is to change the relationship that people have with money, making money management less complicated. The app offers a friendly and flexible approach that enables users to enjoy guilt-free spending and effortless saving, helping them grow their savings and spend their money in a way that suits them.

    Here are the data tools of YNAB:

Good reads and resources

  • Snowflake and Databricks Summits 2023: Feature Announcement Recaps and Comparisons: The Snowflake Summit and Databricks Data + AI Summit conferences have been a hot topic in the data industry, and Michael Segner provides a comprehensive summary of these events. The conferences are significant as they address challenges and opportunities in the market, and Michael highlights major announcements and themes from the conferences. These include GenAI integration, data app builders/marketplaces, and data sharing. He also notes pre-conference announcements such as Thoughtspot's acquisition of Mode, Databricks' acquisition of MosaicML, and Snowflake's partnership with Microsoft. He goes on to discuss the keynote announcements, such as expanded Iceberg table support, Snowflake native apps, and Snowpark Container Services, as well as Databricks' Lakehouse Apps, Delta Sharing, and various AI features. Microsoft CEO Satya Nadella also joined the Databricks CEO to discuss responsible AI, and updates on Delta Lake 3.0 and Unity Catalog were shared. Overall, Michael provides valuable insights into the latest developments and future direction of the data and AI industry.
  • Data Modeling Techniques For Data Warehouse: Data modeling is a complex process that requires careful consideration of various methodologies and techniques. Here, Mariusz Kujawski delves into the world of data modeling, with a particular focus on dimensional modeling and Kimball's methodology. He provides a comprehensive overview of different data modeling methodologies, including Inmon's methodology, Data Vault, and storing data in one wide table. Mariusz highlights the benefits of dimensional modeling, such as simplicity, improved query performance, flexibility, data consistency, and enhanced user adoption. Additionally, he covers the key principles and components of Kimball's methodology, including entity model to dimensional model transformation, fact and dimension tables, dimension types, slowly changing dimensions, and star schema versus snowflake schema. Finally, Mariusz discusses data loading strategies, such as full load, incremental load, and delta load.

Upcoming data events, summits and webinars

  • Discover the future of data, analytics, and AI at the global DataConnect Conference. With over 50 expert speakers, this event promises to deliver the latest trends, technologies, and innovations in data-related fields. This year's theme, "Data as a Product: An Empathetic Approach to Delivering Value," highlights the importance of delivering value to customers through data. Attendees will learn from industry leaders, collaborate with peers, and connect with the global data community.
    Don't miss out on this opportunity to learn, network, and engage with experts in Columbus, OH on July 20-21, 2023. Get your ticket here!

MDS Jobs

  • Ramp is hiring Senior Analytics Engineer
    Location: New York, Miami, Remote
    Stack: Fivetran, Snowflake, dbt, Looker
    Apply here
  • Vetsource is hiring Manager, Analytics
    Location: Canada & US
    Stack: Fivetran, Snowflake, dbt, Looker, Power BI, Hightouch
    Apply here
  • The California Office of Data and Innovation is hiring Lead Platform Engineer, Data
    Location: Sacramento, CA
    Stack: Snowflake, dbt, Fivetran
    Apply here

Just for fun 😀

Are you always hungry for more information and updates about the ever-evolving world of data?

Well, you're in luck! By following us on LinkedIn and Twitter, you'll gain access to all the latest and greatest data content!

But wait, there's more! We want to hear from you - rate us here and let us know how we're doing.

Love it | It's great |  Good | Okay-ish | Meh

We welcome any suggestions, articles you would like us to showcase, or data engineering job listings that you may have. Don't hesitate to get in touch with us and we would be delighted to incorporate your input into our next edition.

About Moderndatastack.xyz‌‌ - We're building a platform to bring together people in the data community to learn everything about building and operating a Modern Data Stack. It's pretty cool - do check it out :)