Blog

Featured

100+ Books to Read

(updated on August 21, 2023)
HT on 2023-7-11:
“读书时不求记住书中的全部知识,只要有一两个观点促使自己发生了切实的改变就足够了……”——《认知觉醒》

2023:
  1. [x]春光好,黄佟佟(2023-1-24 - 2023-1-26)
  2. [x]通俗小说,仁科(2023-1-24 - 2023-1-31)
  3. [x]精通区块链编程:加密货币原理、方法和应用开发,[希]Andreas M. Antonopoulos
  4. [x]精通以太坊:开发智能合约和去中心化应用,[希]Andreas M. Antonopoulos
  5. [x]Bitcoin and Cryptocurrency Technologies: A Comprehensive Introduction,Arvind Narayanan et al
  6. [x]区块链:技术驱动金融,[美]Arvind Narayanan等著
  7. [x]区块链:从数字货币到信用社会,长铗,韩锋等
  8. [x]奇特的一生,2023年7月7日
  9. [x]认知觉醒,2023年7月12日
  10. [x]卡片笔记写作法,2023年7月16日
  11. [x]凝视深渊:悬案、侧写和我对破译犯罪心理的探索,2023年8月9日
  12. [x]自我突围,施一公,2023年8月9日
  13. [x]挣脱,刘任侠,2023年8月10日
  14. [x]春天,我想去田野里采一朵花,2023年8月20日
2022: [23 books]
  1. [x]解忧杂货店,东野圭吾(2022-1-1)
  2. [x]恶意,东野圭吾(2022-1-5)
  3. [x]可是我偏偏不喜欢,吴晓乐(2022-1-7)
  4. [x]只差最后一个谎言,东野圭吾(2022-1-8 start)2022-1-10
  5. [x]蛤蟆先生去看心理医生(2022-1-12)
  6. [x]一个陌生女人的来信(短篇小说),茨维格(2022-1-16)
  7. [x]世界尽头的咖啡馆(2022-1-16)
  8. [x]白夜行,东野圭吾(2022-1-21)
  9. [x]律政雄心:一位亚裔女孩的最高法院之路(戴维·拉特)(2022-1-23)
  10. [x]Python程序设计基础(第2版),董付国编著,清华大学出版社(2022-1-24)
  11. [x]Python语言程序设计基础(第2版),颂天等著,高等教育出版社(2022-1-24)
  12. [x]祈念守护人,东野圭吾(2022-1-27)
  13. [x]天才基本法(4本),长洱(2022-2-2)
  14. [x]向阳花海,林烁(2022-2-8)
  15. [x]夜晚的潜水艇,陈春成(2022-2-8开始,2022-2-12完成)
  16. [x]如果人类没有爱情就好了,邵艺辉(2022-2-13开始,2022-2-14完成)
  17. [x]狂人日记,鲁迅(短篇小说,2022-2-18)
  18. [x]学问的冒险,严飞(2022-2-16开始,2022-2-19)
  19. [x]伤心咖啡馆之歌(中篇小说,2022-2-20)
  20. [x]刻意练习
  21. [x]兄弟,余华(2022-10-21)
  22. [x]学者的术与道(2022-10-26)
  23. [x]圆圈正义(2022-11-6)

2021: [22 books]
  1. [x]Blockchain: A practical guide to developing business, law and technology solutions
  2. [x]Blockchain for dummies
  3. [x]Bitcoin and Cryptocurrency Technologies
  4. [x]白话区块链,蒋勇等
  5. [x]智能之门,胡晓武等
  6. [x]尘埃落定,阿来
  7. [x]长长的回廊,东野圭吾
  8. [x]善哉善哉,就你话多 2021-6-15
  9. [x]我承认我不曾经历沧桑,蒋方舟 2021-7-13
  10. [x]宇宙超度指南,李诞
  11. [x]笑场,李诞 2021-7-26
  12. [x]无人生还,阿加莎 2021-8-2
  13. [x]微信背后的产品观,张小龙编著 2021-8-2
  14. [x]冷场 2021-8-3
  15. [x]鸳鸯六七四 2021-8-10
  16. [x]李诞脱口秀工作手册 2021-9-12
  17. [x]起床后的黄金1小时,池田千惠 2021-10-6
  18. [x]窄门,纪德 2021-10-27
  19. [x]紫金陈高智商犯罪 2021-11-16
  20. [x]紫金陈低智商犯罪 2021-11-24
  21. [x]人工智能简史,尼克 2021-12-7
  22. [x]为什么精英都是时间控 2021-12-14

2020: [8 books]
Teng on January 2, 2020
I was searching for motivation/inspiration of my reading in 2020. The following quote fits my need perfectly.
“A reader lives a thousand lives before he dies, said Jojen. The man who never reads lives only one.” --- George R. R. Martin. A Dance with Dragons
  1. [x]格差社会, (日)橘木俊诏,2019 [Jan 2-3, 2020]
  2. [x]薛兆丰经济学讲义, 薛兆丰, 2018 [Jan 4-8, 2020]
  3. [x]不三, 冯唐 [Mar 16, 2020]
  4. [x]云雀叫了一整天,木心 [Mar 23, 2020]
  5. [x]方方日记,汪芳 [April 20, 2020]
  6. [x]房思琪的初恋乐园 [May 4, 2020]
  7. [x]围城 [Aug 24 - 29, 2020]
  8. [x]你来人间一趟,你要看看太阳,海子 [Mar 17, 2020 - Aug 29, 2020]

Teng on July 7, 2019

I recently learned that other PhD students experienced mid-PhD crisis as well [link]. To cope with it, some recommend to start small projects to nurture a sense of accomplishment and to see things get finished. Those small projects could be baking a cake and taking a painting class [link]. I started something similar in the end of 2017 when I was in the first semester of my third year. Other than the joy from reading and learning, compiling a list relieves some level of anxiety too.

2019: [13 books]
  1. [x]a phd is not enough
  2. [x]Siddhartha
  3. [x]Teaching College
  4. [x]成事 - 冯唐
  5. [x]the non-designer’s design book, fourth edition
  6. [x]中国人在硅谷——netscreen的故事
  7. [x]不二 - 冯唐
  8. [x]三十六大 - 冯唐
  9. [x]the professor is in 
  10. [x]The book of why by Judea Pearl
  11. [x]101 Job Interview Questions You’ll Never Fear Again, James Reed
  12. [x]Letters to a Young Scientist, Edward O. Wilson
  13. [x]how to answer interview questions - Peggy McKee
2018: [20 books]
  1. [x]dare to lead
  2. [x]the subtle art of not giving a fuck
  3. [x]portraits of courage - George W Bush
  4. [x]Almost Interesting - David Spade
  5. [x]girl, wash your face
  6. [x]不再神圣的经济学
  7. [x]David Spade: a Polaroid guy in a snapchat world
  8. [x]book yourself solid
  9. [x]指数基金投资指南
  10. [x]how to win friends and influence people
  11. [x]番茄工作法图解
  12. [x]Nudge
  13. [x]the influential minds
  14. [x]the power of habit
  15. [x]a higher loyalty
  16. [x]the 5 second rule
  17. [x]Tig Notaro: I’m just a person
  18. [x]Lean in for Graduates: Sheryl Sandberg
  19. [x]thank you for being late
  20. [x]I can’t make this up - life lessons
2017: [3 books]
  1. [x]everybody lies
  2. [x]how to start a conversation and make friends
  3. [x]Breaking Free

Can we derive formulas from neural networks or decision trees?

(updated on June 10, 2020)

I was wondering if we could derive numerical relationships between input variables and output variables from nonlinear model structures such neural networks and decision trees. I find a Q&A on ResearchGate, and I like the answer.

One answer wrote: "It is possible to obtain an equation after developing an ANN for prediction. in fact, you will end up with a long equation including the inputs, weights, biases, etc."

Multi-output Regression Models

(updated on June 10, 2020)

A multi-output regression task predicts multiple numerical properties for each sample (reference).


The article titled "Regression Models with multiple target variables" by Kiran Karkera (link) covers exactly what I am interested. Here are the key points other than the modeling details.

  1. Terminology: multi-output regression or multi-target regression; related terms for classification tasks are multi-label classification, multi-class classification, and multioutput-multiclass classification (aka multi-task classification).
  2. Popular open source ML libraries have little support for the multi-output regression task.

These are the two papers that are mentioned in Kiran Karkera's article.


Here is an article discussing how to develop multi-output regression models with python posted online on March 27, 2020.

 

2020 OR Talks

(update on April 20, 2020)

Analytics for a Better World Webinars (ABW-W) [more information]

  1. Wednesday April 29, 2020,  EST 11AM (CET 17.00 PM), Speaker: Dimitris Bertsimas (MIT, Cambridge) | on a variety of aspects of COVID-19
  2. Wednesday May 27, 2020, CET 17.00 PM, Speakers: Koen Peters (World Food Programme and Zero Hunger Lab, Tilburg University), Hein Fleuren (Zero Hunger Lab, Tilburg University) | They will highlight Tilburg University's (The Netherlands) work at the United Nations - World Food Programme (WFP) headquarters in Rome

Discrete Optimization Talks (DOTs) [homepage]

Collection of Talks on COVID-19’s Impacts

INFORMS Chicago Chapter Webinar Series

  1. Modeling Supply Chain Decisions for Resource Sharing with an Application to Ventilator Allocation to Combat COVID-19 | Dr. Sanjay Mehrotra, Northwestern University | June 18, 2020 [paper] [info]
  2. Risk-based Thinking in a COVID-19 World | Dr. Sheldon Jacobson, University of Illinois | June 26, 2020 [info]

Managing Uncertainty: Adapting to and learning from the COVID-19 crisis - A Rotman Insights Webinar Series [link]

  1. Pandemic Protectionism: The Global Trade Impact of COVID-19, May 8, 2020
  2. Distribution Disruption: How to Build a Better Supply Chain after COVID-19, May 15, 2020

Articles

  1. Yuval Noah Harari: the world after coronavirus, March 20, 2020 | [financial times]
  2. Life in the Time of COVID18 - Prof Adam Przeworski, April 5, 2020 [Google Doc]

VMACS - Virtual Macro Seminar

  1. VMACS | Pandemics According to HANK - Greg Kaplan, Ben Moll, Gianluca Violante, March 31, 2020  | [YouTube] [slides]
  2. VMACS | Macroeconomic Implications of COVID-19: Can Negative Supply Shocks Cause Demand Shortages? April 2, 2020 | [YouTube]
  3. VMACS | Pandemics Depress the Economy, Public Health Interventions Do not: Evidence from the 1918 Flu | April 7, 2020 | [YouTube]
  4. VMACS | The Macroeconomics of Epidemics - Martin Eichenbaum, Sergio REbelo, and Mathias Trabandt, April 9, 2020 | [YouTube]
  5. VMACS | The Behavioral SIR Model, with Application to the Swine Flue Epedemic | For Whom the Bell Tolls: Avoidance Behavior at Breakout in COVID19, April 10, 2020 | [YouTube]

INFORMS online seminar

  1. From Pandemic Disruption to Global Supply Chain Recovery - Dr. Simchi-Levi, April 9, 2020 | INFORMS | [video] [slides]
  2. Ripple Effect in Supply Chains at Different Pandemic Stages - Dr. Dmitry Ivanov, May 14, 2020 | INFORMS
  3. Analytics in Turbulent Times - Alan Jacobson, June 17, 2020 | INFORMS [YouTube]
  4. Blood, Sweat, & PPEs: Rescuing Perishable Product Supply Chains & Impacting Policy through Analytics - Dr. Anna Nagurney, June 15, 2020 | INFORMS [YouTube]

Royal Economic Society Webinar Series

  1. Economic approaches for analysing the short, medium term and long run impact of the COVID-19 crisis -  Prof Tim Besley (Chair), Prof Daron Acemoglu, Prof Jean Tirole, April 9, 2020 | Royal Economic Society [video]

Example Videos for Data Analytics

This post is for my students in OPIM3103-008 Spring 2020. Compare these examples to the two for Management Information Systems.

The Math Behind Basketball's Wildest Moves | Rajiv Maheswaran | TED Talks

How data transformed the NBA | The Economist

This is an easter egg. Email me with (1) the following quote; (2) your full name; and (3) the names of three types of Access database objects to me at teng.huang@uconn.edu. You will get two bonus points if your answer to (3) is correct and one bonus point if otherwise. This easter egg is active from 0:00 March 30, 2020 to 11:59pm April 5, 2020. (Email me within this time period.)

Here is the quote.

Don't be pushed by your problems. Be led by your dreams. -- Ralph Waldo Emerson

Database and Management Information Systems

This post is for my students in OPIM3103-008 Spring 2020.

A video for introduction to a brief history of databases.

The following two videos are to show you how information systems are implemented in various industries. I hope you could find some inspirations for your Project Obsession.

23andme DNA Processing Lab

Behind the scenes of an Amazon warehouse

This is an easter egg. Email me with (1) the following quote; (2) your full name; and (3) the answer to the multiple choice question in the end to me at teng.huang@uconn.edu. You will get two bonus points if your answer to this multiple choice question is correct; and one bonus point if otherwise. This easter egg is active from 0:00 April 6, 2020 to 11:59pm April 12, 2020. (Email me within this time period.)

Here is the quote.

This is the real secret to life—to be completely engaged with what you are doing
in the here and now. And instead of calling it work, realize it is play. -- Alan Watts

Here is the multiple choice question.

Which of the following are properties of primary keys of Access Tables? (Select all that apply)

a. The field that uniquely identifies each record in a table.

b. Access does not require that each table have a primary key.

c. A good database design usually includes a primary key in each table.

d. You should select unique and infrequently changing data for the primary key.

My Inspiration

 

(September 1, 2020)

刘慈欣2018卡拉克奖获奖感言

在未来,当人工智能拥有超过人类的智力时,想象力也许是我们对于它们所拥有的唯一优势。

(Teng: I do not think so.)

(July 25, 2020)

J. D. Salinger, The Heart of a Broken Story

I think love is a touch and yet not a touch.

(May 25, 2020)

Romain Rolland

There is only one heroism in the world: to see the world as it is and to love it.

(Mar 21, 2020)

George Pólya

If you can’t solve a problem, then there is an easier problem you can solve: find it. -- How To Solve It: A New Aspect of Mathematical Method, G. Polya, Stanford University


(Feb 8, 2020)

Joseph Campbell

We must be willing to let go of the life we’ve planned so as to have the life that is waiting for us.


(Feb 6, 2020)

William Blake, English poet, painter, and printmaker. Largely unrecognized during his lifetime, Blake is now considered a seminal figure in the history of the poetry and visual arts of the Romantic Age. [more on Wikipedia]

What is now proved was once  only imagined.

I must create a system, or be enslaved by another man's. I will not reason and compare: my business is to create.

You never know what is enough unless you know what is more than enough.

Exuberance is beauty.

To generalize is to be an idiot.

That Man should labour and sorrow, and learn and forget and return to the dark valley whence he came, and begin his labours anew. -- The works of William Blake, poetic, symbolic, and critical


(Jan 19, 2020)

Voltaire, French philosopher:

Uncertainty is an uncomfortable position. But certainty is an absurd one.

Chinese proverb (according to A. Ben-Tal and L. El-Ghaoui. Robust Optimization. Princeton University Press, 2009; I haven't found the Chinese translation.):

To be uncertain is to be uncomfortable, but to be certain is to be ridiculous.


(Jan 18, 2020)

Friedrich Nietzsche:

He who has a why to live can bear almost any how.

 

HOWTO: Work with Small Datasets

(updated on January 12, 2020; not complete yet)

 

In general:

  1. 7 Effective Ways to Deal with a Small Dataset [link]
  2. Dealing with very small datasets [link]
  3. What to do with "small" data? [link]

 

 

For Images:

  1. Breaking the curse of small datasets in Machine Learning: Part 1 [link]
  2. Breaking the curse of small data sets in Machine Learning: Part 2 [link]
  3. You can probably use deep learning even if your data isn't that big [link]
  4. Applying deep learning to real-world problems [link]

HOWTO: Data Augmentation

[last updated on January 12, 2020; not complete yet]

 

Data Augmentation:

  1. Research Guide: Data Augmentation for Deep Learning, [Nearly] Everything you need to know in 2019 [link], keywords: Random Erasing Data Augmentation (2017), AutoAugment: Learning Augmentation Strategies from Data (CVPR 2019), Fast AutoAugment (2019), Learning Data Augmentation Strategies for Object Detection (2019), SpecAugment: for Automatic Speech Recognition (Interspeech 2019), EDA: for Boosting Performance on Text Classification Tasks (EMNLP-IJCNLP 2019), Unsupervised Data Augmentation for Consistency Training (2019)
  2. Data augmentation on entire dataset before splitting [link], conclusion: this practice is incorrect.
  3. How does data augmentation reduce overfitting? [link]

 

Data Augmentation for Regression Tasks:

Online articles that mentioned DA for regression tasks:

  1. Shehroz Khan's answer to What does the term data augmentation mean in the context of machine learning? [link]
  2. What you need to know about data augmentation for machine learning [link]
  3. Data augmentation techniques for general datasets? [link] (Teng: To me, it seems they were discussing feature engineering instead of adding more data points.)
  4. Data Augmentation Techniques for Cat/Binary/Continuous Numerical Dataset [link], keywords: SMOTE

 

 

Data Augmentation for Unbalanced Dataset in Classification Tasks:

  1. Oversampling and undersampling in data analysis [link]
  2. imbalanced-learn [GitHub] [docs]
  3. A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features [docs] [GitHub]
  4. A Deep Dive Into Imbalanced Data: Over-Sampling [link]
  5. SMOTE for high-dimensional class-imbalanced data, Rok Blagus and Lara Lusa, 2013 [link]
  6. SMOTE explained for noobs - Synthetic Minority Over-sampling TEchnique line by line [link]
  7. Detecting representative data and generating synthetic samples to improve learning accuracy with imbalanced data sets [link]
  8. ADASYN: Adaptive Synthetic Sampling Method for Imbalanced Data [link]

 

Data Augmentation for Image:

  1. Data Augmentation for Deep Learning [link], keywords: image augmentation packages, PyTorch framework
  2. 1000x Faster Data Augmentation [link], keywords: learn augmentation policies, Population Based Augmentation, Tune Framework
  3. A survey on Image Data Augmentation for Deep Learning, Connor Shorten and Taghi M. Khoshgoftaar [link]
  4. Python | Data Augmentation [link]
  5. How to Configure Image Data Augmentation in Keras [link]
  6. Data Augmentation | How to use Deep Learning when you have Limited Data -- Part 2 [link], keywords: online augmentation, offline augmentation
  7. Data augmentation for improving deep learning in image classification problem, Mikolajczyk et al. [link]
  8. The Effectiveness of Data Augmentation in Image Classification using Deep Learning, Jason Wang and Luis Perez [link]

 

Data Augmentation for Audio:

to be added ...

 

Data Augmentation for Texts:

  1. These are the Easiest Data Augmentation Techniques in Natural Language Processing you can think of -- and they work. Simple text editing techniques can make huge performance gains for small datasets. [link]

 

Data Augmentation for Time Series

  1. Data Augmentation strategies for Time Series Forecasting [link]