ds_journal
About
Journal of Data Science Journey
Categories
All
(5)
My Understanding of Pydantic
I am recently reading the book
Robust Python
by Patrick Viafore. In this blog post, I want to discuss the usage of pydantic library, introduced in chapter 14 of the book.
Jan 26, 2022
Construct ROC Curve from Scratch
This post aims to show how to construct the receiver operating characteristic (roc) curve without using predefined functions. It hopes to help you better understand how the roc curve is constructed and how to interpret it.
Nov 18, 2021
Martin Pan
Notes from Data Science at the Command Line
In this post, I want to write down some ideas from Jeroen Janssens’s book
Data Science at the Command Line, 2e
. Instead of being a comprehensive overview of the book, this note only picks some ideas that add to my existing knowledge and hopefully it can bring some new thoughts to you. Of course, this blog post reflects my personal understanding and all errors remain mine.
Aug 19, 2021
Entity Embedding in fastai 2
This post aims to introduce how to use fastai v2 to implement entity embedding for categorical variables in tabular data. Entity embedding is a powerful technique that can sometimes boost the performance of various machine learning methods and reveal the intrinsic properties of categorical variables. See this paper and this post for more details.
Dec 21, 2020
Martin Pan
Logistic Regression’s Coefficients Explanation
This post is going to introduce logistic regression with an emphasis on explaining the coefficients in the output of Python’s statsmodels.
Nov 20, 2020
Martin Pan
No matching items