Blog

Featured

100+ Books to Read

(updated on March 23, 2020)
2020:
Teng on January 2, 2020
I was searching for motivation/inspiration of my reading in 2020. The following quote fits my need perfectly.
“A reader lives a thousand lives before he dies, said Jojen. The man who never reads lives only one.” --- George R. R. Martin. A Dance with Dragons
  1. [x]格差社会, (日)橘木俊诏,2019 [Jan 2-3, 2020]
  2. [x]薛兆丰经济学讲义, 薛兆丰, 2018 [Jan 4-8, 2020]
  3. [x]不三, 冯唐 [Mar 16, 2020]
  4. [x]云雀叫了一整天,木心 [Mar 23, 2020]

Teng on July 7, 2019

I recently learned that other PhD students experienced mid-PhD crisis as well [link]. To cope with it, some recommend to start small projects to nurture a sense of accomplishment and to see things get finished. Those small projects could be baking a cake and taking a painting class [link]. I started something similar in the end of 2017 when I was in the first semester of my third year. Other than the joy from reading and learning, compiling a list relieves some level of anxiety too.

2019: [13 books]
  1. [x]a phd is not enough
  2. [x]Siddhartha
  3. [x]Teaching College
  4. [x]成事 - 冯唐
  5. [x]the non-designer’s design book, fourth edition
  6. [x]中国人在硅谷——netscreen的故事
  7. [x]不二 - 冯唐
  8. [x]三十六大 - 冯唐
  9. [x]the professor is in 
  10. [x]The book of why by Judea Pearl
  11. [x]101 Job Interview Questions You’ll Never Fear Again, James Reed
  12. [x]Letters to a Young Scientist, Edward O. Wilson
  13. [x]how to answer interview questions - Peggy McKee
2018: [20 books]
  1. [x]dare to lead
  2. [x]the subtle art of not giving a fuck
  3. [x]portraits of courage - George W Bush
  4. [x]Almost Interesting - David Spade
  5. [x]girl, wash your face
  6. [x]不再神圣的经济学
  7. [x]David Spade: a Polaroid guy in a snapchat world
  8. [x]book yourself solid
  9. [x]指数基金投资指南
  10. [x]how to win friends and influence people
  11. [x]番茄工作法图解
  12. [x]Nudge
  13. [x]the influential minds
  14. [x]the power of habit
  15. [x]a higher loyalty
  16. [x]the 5 second rule
  17. [x]Tig Notaro: I’m just a person
  18. [x]Lean in for Graduates: Sheryl Sandberg
  19. [x]thank you for being late
  20. [x]I can’t make this up - life lessons
2017: [3 books]
  1. [x]everybody lies
  2. [x]how to start a conversation and make friends
  3. [x]Breaking Free

Example Videos for Data Analytics

This post is for my students in OPIM3103-008 Spring 2020. Compare these examples to the two for Management Information Systems.

The Math Behind Basketball's Wildest Moves | Rajiv Maheswaran | TED Talks

How data transformed the NBA | The Economist

This is an easter egg. Email me with (1) the following quote; (2) your full name; and (3) the names of three types of Access database objects to me at teng.huang@uconn.edu. You will get two bonus points if your answer to (3) is correct and one bonus point if otherwise. This easter egg is active from 0:00 March 30, 2020 to 11:59pm April 5, 2020. (Email me within this time period.)

Here is the quote.

Don't be pushed by your problems. Be led by your dreams. -- Ralph Waldo Emerson

Database and Management Information Systems

This post is for my students in OPIM3103-008 Spring 2020.

A video for introduction to a brief history of databases.

The following two videos are to show you how information systems are implemented in various industries. I hope you could find some inspirations for your Project Obsession.

23andme DNA Processing Lab

Behind the scenes of an Amazon warehouse

This is an easter egg. Email me with (1) the following quote; (2) your full name; and (3) the answer to the multiple choice question in the end to me at teng.huang@uconn.edu. You will get two bonus points if your answer to this multiple choice question is correct; and one bonus point if otherwise. This easter egg is active from 0:00 April 6, 2020 to 11:59pm April 12, 2020. (Email me within this time period.)

Here is the quote.

This is the real secret to life—to be completely engaged with what you are doing
in the here and now. And instead of calling it work, realize it is play. -- Alan Watts

Here is the multiple choice question.

Which of the following are properties of primary keys of Access Tables? (Select all that apply)

a. The field that uniquely identifies each record in a table.

b. Access does not require that each table have a primary key.

c. A good database design usually includes a primary key in each table.

d. You should select unique and infrequently changing data for the primary key.

My Inspiration

 

(Mar 21, 2020)

George Pólya

If you can’t solve a problem, then there is an easier problem you can solve: find it. -- How To Solve It: A New Aspect of Mathematical Method, G. Polya, Stanford University


(Feb 8, 2020)

Joseph Campbell

We must be willing to let go of the life we’ve planned so as to have the life that is waiting for us.


(Feb 6, 2020)

William Blake, English poet, painter, and printmaker. Largely unrecognized during his lifetime, Blake is now considered a seminal figure in the history of the poetry and visual arts of the Romantic Age. [more on Wikipedia]

What is now proved was once  only imagined.

I must create a system, or be enslaved by another man's. I will not reason and compare: my business is to create.

You never know what is enough unless you know what is more than enough.

Exuberance is beauty.

To generalize is to be an idiot.

That Man should labour and sorrow, and learn and forget and return to the dark valley whence he came, and begin his labours anew. -- The works of William Blake, poetic, symbolic, and critical


(Jan 19, 2020)

Voltaire, French philosopher:

Uncertainty is an uncomfortable position. But certainty is an absurd one.

Chinese proverb (according to A. Ben-Tal and L. El-Ghaoui. Robust Optimization. Princeton University Press, 2009; I haven't found the Chinese translation.):

To be uncertain is to be uncomfortable, but to be certain is to be ridiculous.


(Jan 18, 2020)

Friedrich Nietzsche:

He who has a why to live can bear almost any how.

 

HOWTO: Work with Small Datasets

(updated on January 12, 2020; not complete yet)

 

In general:

  1. 7 Effective Ways to Deal with a Small Dataset [link]
  2. Dealing with very small datasets [link]
  3. What to do with "small" data? [link]

 

 

For Images:

  1. Breaking the curse of small datasets in Machine Learning: Part 1 [link]
  2. Breaking the curse of small data sets in Machine Learning: Part 2 [link]
  3. You can probably use deep learning even if your data isn't that big [link]
  4. Applying deep learning to real-world problems [link]

HOWTO: Data Augmentation

[last updated on January 12, 2020; not complete yet]

 

Data Augmentation:

  1. Research Guide: Data Augmentation for Deep Learning, [Nearly] Everything you need to know in 2019 [link], keywords: Random Erasing Data Augmentation (2017), AutoAugment: Learning Augmentation Strategies from Data (CVPR 2019), Fast AutoAugment (2019), Learning Data Augmentation Strategies for Object Detection (2019), SpecAugment: for Automatic Speech Recognition (Interspeech 2019), EDA: for Boosting Performance on Text Classification Tasks (EMNLP-IJCNLP 2019), Unsupervised Data Augmentation for Consistency Training (2019)
  2. Data augmentation on entire dataset before splitting [link], conclusion: this practice is incorrect.
  3. How does data augmentation reduce overfitting? [link]

 

Data Augmentation for Regression Tasks:

Online articles that mentioned DA for regression tasks:

  1. Shehroz Khan's answer to What does the term data augmentation mean in the context of machine learning? [link]
  2. What you need to know about data augmentation for machine learning [link]
  3. Data augmentation techniques for general datasets? [link] (Teng: To me, it seems they were discussing feature engineering instead of adding more data points.)
  4. Data Augmentation Techniques for Cat/Binary/Continuous Numerical Dataset [link], keywords: SMOTE

 

 

Data Augmentation for Unbalanced Dataset in Classification Tasks:

  1. Oversampling and undersampling in data analysis [link]
  2. imbalanced-learn [GitHub] [docs]
  3. A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features [docs] [GitHub]
  4. A Deep Dive Into Imbalanced Data: Over-Sampling [link]
  5. SMOTE for high-dimensional class-imbalanced data, Rok Blagus and Lara Lusa, 2013 [link]
  6. SMOTE explained for noobs - Synthetic Minority Over-sampling TEchnique line by line [link]
  7. Detecting representative data and generating synthetic samples to improve learning accuracy with imbalanced data sets [link]
  8. ADASYN: Adaptive Synthetic Sampling Method for Imbalanced Data [link]

 

Data Augmentation for Image:

  1. Data Augmentation for Deep Learning [link], keywords: image augmentation packages, PyTorch framework
  2. 1000x Faster Data Augmentation [link], keywords: learn augmentation policies, Population Based Augmentation, Tune Framework
  3. A survey on Image Data Augmentation for Deep Learning, Connor Shorten and Taghi M. Khoshgoftaar [link]
  4. Python | Data Augmentation [link]
  5. How to Configure Image Data Augmentation in Keras [link]
  6. Data Augmentation | How to use Deep Learning when you have Limited Data -- Part 2 [link], keywords: online augmentation, offline augmentation
  7. Data augmentation for improving deep learning in image classification problem, Mikolajczyk et al. [link]
  8. The Effectiveness of Data Augmentation in Image Classification using Deep Learning, Jason Wang and Luis Perez [link]

 

Data Augmentation for Audio:

to be added ...

 

Data Augmentation for Texts:

  1. These are the Easiest Data Augmentation Techniques in Natural Language Processing you can think of -- and they work. Simple text editing techniques can make huge performance gains for small datasets. [link]

 

Data Augmentation for Time Series

  1. Data Augmentation strategies for Time Series Forecasting [link]

 

 

Academic Job Listings for Operations Management and Information Systems

OM/OR

  1. Operations Academia (link)
  2. Decision Science Institute Job Postings (link)
  3. POMS job openings (link)
  4. INFORMS Career Center (link) and mailing lists
  5. DMANET (link)
  6. ORNET(link)
  7. HigherEdJobs (link)
  8. AcademicJobsOnline (link)
  9. MathJobs (American Mathematical Society) (link)

IS

  1. Association for Information Systems Career Services (link) and mailing lists (link)

* By courtesy of Miao Bai, David Bergman, and Yuan Jin.

** Please let me know if I missed anything.

Data Visualization

Videos:

  • A Brief History of Data Visualization | Stanford [link]
  • History of Data Visualization and Telling a Story with Data | UC Berkeley Events [link]
  • 34 Data Visualization: A Brief History of Maps, Time Series, and Charts (FR) | Berkeley Initiative for Transparency in the Social Sciences (BITSS) [link]

Online Articles:

  • A brief history of data visualization | Jon Hazell [link]
  • 8 fantastic examples of data storytelling | import.io [link]
  • Randy Olson here to answer all of your questions about data visualization [link]
  • Spurious Correlations [link]

Books:

  • Tukey | Exploratory Data Analysis [amazon]
  • Robin Williams | The Non-Designer's Design Book (4th Edition) [amazon]
  • Gene Zelazny | Say It with Charts: The Executive's Guide to Visual Communication [amazon]