We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.
Customize Consent Preferences
We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.
The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ...
Always Active
Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.
Cookie
_csrf
Duration
session
Description
This cookie is essential for the security of the website and visitor. It ensures visitor browsing security by preventing cross-site request forgery.
Cookie
connect.sid
Duration
14 days
Description
This cookie is used for authentication and for secure log-in. It registers the log-in information.
Cookie
cookieyes-consent
Duration
1 year
Description
CookieYes sets this cookie to remember users' consent preferences so that their preferences are respected on subsequent visits to this site. It does not collect or store any personal information about the site visitors.
Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.
Cookie
locale
Duration
1 year
Description
Facebook sets this cookie to enhance the user's browsing experience on the website, and to provide the user with relevant advertising while using Facebook’s social media platforms.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.
Cookie
_ga_*
Duration
1 year 1 month 4 days
Description
Google Analytics sets this cookie to store and count page views.
Cookie
_ga
Duration
1 year 1 month 4 days
Description
Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
No cookies to display.
Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.
Use Hypothesis, whether you like writing tests or not
Cheuk Ting Ho
02:15-02:45 @ NYCU
I bet you like writing tests. But instead of the example-based tests that we normally write, have you heard of property-based testing? By using Hypothesis, instead of thinking about what data I should test it for, it will generate test data, including Numpy and Pandas objects, for you.
Build an useful data science product in your orgnaization
zonghan
02:55-03:25 @ NYCU
Data science products within private organizations usually start with high expectation and end up low usage. This talk is about what are the common pitfalls of these data science products and shares some perspective of building useful data science projects.
Data Contracts: Empowering Data Quality Enforcement
Shuhsi Lin
05:40-06:10 @ NYCU
In the realm of modern data engineering, ensuring data quality and fostering effective collaboration across teams are paramount. The introduction of DBT model contracts marks a pivotal advancement in this domain. These contracts provide a structured framework for defining and enforcing expectations about the output of data models. By examining the significance of DBT model contracts, this talk delves into their role in elevating data reliability, streamlining debugging processes, and optimizing resource utilization. We will explore the compelling advantages of using model contracts, from fostering collaborative data culture to enhancing change management. However, the journey isn't without its challenges. Limited platform support, potential complexity, and the need for effective communication are among the hurdles to overcome. By comprehending the transformative potential and navigating potential pitfalls, this talk aims to empower data practitioners with insights to leverage DBT model contracts effectively, ultimately elevating data quality, team efficiency, and decision-making across the organization.
In the current data-driven world, we are always face on large data volumn storage, analytics and machine-learning application problem. In ths past, we always use database, data lake or data warehouse to store different data, includes structured data, unstructured data or semi-structured data. Although current have many related storage and tool can solve corresponding problems and scenraio, still have some limitation and imperfection.
In order to improve these, one concept gradually is discussed in these year. That is a Lakehouse, which integrate data lake and data warehouse advantages so that become a powerful architecture to implement modern data stack. Based on this concept, have some completed service and tool can implement it. Includes Databricks - Delta Lake, Apache Iceberg or Apache Hudi.
In this session, i will quickly describe and analyze these concept, benefits and drawbacks about database, data lake, data warehouse and lakehouse. And introduce some represent service. Lastly, i will show some demo about lakehouse so that attendees can more understand it specifically.