5 min read

MDS Newsletter #89

MDS Newsletter #89

🤔 Are you passionate about delving into data? Are you an active member of the data community who always stays up-to-date with the latest trends? If so, we have the perfect edition for you! This latest edition is packed with exciting content, including a brand new podcast episode featuring the founders of Dozer, news about the funding of a new data startup, and exclusive insights into what's in store at the upcoming Snowflake Summit.

So why wait? Jump right in and start exploring...

Modern Data Show S02 E15

S02 E15 Data Source to API in Minutes with Matteo Pelati and Vivek Gudapuri, founders at Dozer: Prepare to be amazed in this episode as Matteo Pelati and Vivek Gudapuri, the brilliant minds behind Dozer, reveal their experience in pushing the boundaries of data management and analysis. By simplifying the process of data serving and allowing companies to create APIs quickly and efficiently, Dozer's approach sets them apart from the modern data stack. Their open-source approach allows developers to build custom operators and extend connectors, ensuring that Dozer can cover a wide range of use cases while still offering customization at each step. They also discuss the challenges they faced during the development of Dozer and how they are positioned to adapt to upcoming trends and developments in real-time data processing.  

You can listen to this episode on Spotify, YouTube, Google Podcast, Apple Podcast and Amazon Music

  • lakeFS: LakeFS provides an open-source data version control system for data lakes. Their platform enables git-like operations on data, allowing for time travel between consistent snapshots and promoting only high-quality data to production. With lakeFS, organizations can increase data quality, reduce errors, and improve data engineering practices.

    lakeFS has raised a total of $23M in funding over 1 round. This was a Series A round raised on Jul 28, 2021.
  • Cinchy: Cinchy is a data collaboration platform that transforms enterprise work by enabling business teams to execute their desired outcomes, freeing them from reliance on technical teams. It removes rigid data contracts and integration-heavy approaches to business unit collaboration, empowering organizations to achieve outcomes faster and more efficiently. Cinchy transforms data silos into meshed data products, where data is governed yet freely available, enabling collaborative intelligence throughout the business.

    Cinchy has raised a total of $24.2M in funding over 8 rounds. Their latest funding was raised on Oct 27, 2022, from a Series B round.
  • Veed: VEED.IO is an AI-powered online video editing platform that makes creating videos easy and accessible to everyone.

    Here are the data tools of Veed:

Good reads and resources

  • Are You a Data Ticket Taker or Decision Maker?: In the world of data, being reactive can be costly. Barr Moses, the author of this article makes a compelling case for why proactive data teams are more valuable in today's economic conditions. He outlines four practical ways data leaders can transition from being reactive to proactive decision-makers. These include prioritizing organizational goals and customer needs, setting metrics for driving business growth, identifying and addressing data quality issues, and viewing team efforts as investments. Barr concludes by emphasizing the importance of partnering with business teams and focusing on customer impact. Readers are invited to explore additional examples of proactive data teams.
  • How Data Lineage is Revolutionizing Data Integration, Migration, and Cloud Computing: Data lineage is becoming increasingly important in managing and utilizing large amounts of data. It is enabling organizations to map their data flow and gain a comprehensive understanding of their data sources, transformations, and usage. Niraj Kumar explains that data lineage is being used to support data integration and migration projects by providing a clear understanding of the data flows and dependencies between different systems and applications. It is also being used in cloud computing environments to ensure data consistency and integrity, establish governance policies and procedures, and identify potential security risks. As data continues to grow in complexity and volume, data lineage will remain an essential component of effective data management strategies.

Data startup funding news

Elementl raises $33M Series B for its data orchestration platform based on Dagster: Elementl empowers organizations to build a productive and scalable data platform. Elementl is building Dagster, an open-source orchestration platform for the development, production, and observation of data assets. Elementl's platform focuses on data assets, rather than tasks, and provides a ledger of every asset and its state transitions, with metadata. The company has also launched a tool that allows Airflow users to run pipelines written for it on Dagster.

Founder of Elementl: Nick Schrock

Upcoming data events, summits and webinars

  • Experience the world's largest data, apps, and AI conference - Snowflake Summit! Join data professionals, data scientists, and application developers at Caesars Forum in Las Vegas from June 26-29 to learn from experts about how data, generative AI, and LLM are reshaping the enterprise. Witness the latest innovations coming to the Data Cloud with advanced sessions on Generative AI and LLM.
    Register now for Snowflake Summit and take your organization to the next level!
  • The 2023 RocksDB Mid-Year In-Person Meetup is set to take place on June 13th at the Rockset office locations in San Mateo, California. Attendees will have the opportunity to meet other engineers in the RocksDB community and learn from industry experts. Food and swag will be provided, and a Zoom link will be available for those unable to attend in person. The event will feature talks on topics such as managing stateful streaming pipelines, isolating streaming ingest and queries, building message storage for billions of users, and more.
    Register for the event here!

MDS Jobs

  • EQT is hiring Analytics Engineer
    Location: Hiring roles in New York, Stockholm, and London
    Stack: BigQuery, Fivetran, DBT, Hex, Carto
    Apply Here
  • Skimlinks is hiring Data Engineer
    Location: UK based
    Stack: Python (Expert), Google Cloud, Airflow, DBT, Looker
    Apply Here
  • Alliants is hiring Data Engineer
    Location: United kingdom (Remote)
    Stack: Azure, AWS, Python, Power BI
    Apply Here
  • AirHelp is hiring Senior Data Engineer
    Location: Barcelone, Madrid (Hybrid/Remote)
    Stack: AWS, MySQL, Tableau, BigQuery
    Apply Here

Just for fun 😀

Are you always hungry for more information and updates about the ever-evolving world of data?

Well, you're in luck! By following us on LinkedIn and Twitter, you'll gain access to all the latest and greatest data content!

But wait, there's more! We want to hear from you - rate us here and let us know how we're doing.

Love it | It's great |  Good | Okay-ish | Meh

We welcome any suggestions, articles you would like us to showcase, or data engineering job listings that you may have. Don't hesitate to get in touch with us and we would be delighted to incorporate your input into our next edition.

About Moderndatastack.xyz‌‌ - We're building a platform to bring together people in the data community to learn everything about building and operating a Modern Data Stack. It's pretty cool - do check it out :)