Spark in me - Internet, data science, math, deep learning, philo

snakers4 @ telegram, 1326 members, 1561 posts since 2016

All this - lost like tears in rain.

Data science, deep learning, sometimes a bit of philosophy and math. No bs.

Our website
- spark-in.me
Our chat
- goo.gl/WRm93d
DS courses review
- goo.gl/5VGU5A
- goo.gl/YzVUKf

Posts by tag «digest»:

snakers4 (Alexander), August 12, 11:15

2018 DS/ML digest 20

spark-in.me/post/2018_ds_ml_digest_20

#deep_learning

#digest

#data_science

2018 DS/ML digest 20

2018 DS/ML digest 20 Статьи автора - http://spark-in.me/author/snakers41 Блог - http://spark-in.me


snakers4 (Alexander), July 31, 05:47

2018 DS/ML digest 19

Market / data / libraries

(0) 32k lesions image dataset open-sourced

- goo.gl/CUQwnv

- nihcc.app.box.com/v/DeepLesion

(1) A new Distill article about Differentiable Image Parameterizations

- Usually images are parametrized as RGB values (normalized)

- Idea - use different (learnable) parametrization

- distill.pub/2018/differentiable-parameterizations/

- Parametrizing resulting image with fourier transform enables to use different architectures with style transfer distill.pub/2018/differentiable-parameterizations/#figure-style-transfer-diagram

- Working with transparent images

(2) Lip reading with 40% Word Error Rate arxiv.org/pdf/1807.05162.pdf

(3) Joing auto architecture + hyper param search arxiv.org/pdf/1807.06906.pdf (*)

(4) rl-navigation.github.io/deployable/

(5) New CNN architectures from ICML www.facebook.com/icml.imls/videos/429607650887089/%20 (*)

(6) Jupiter notebook widget for text annotaion github.com/natasha/ipyannotate

(7) A bit more debunking of auto-ml by fast.ai www.fast.ai/2018/07/23/auto-ml-3/

(8) A small intro to Bayes methods alexanderdyakonov.wordpress.com/2018/07/30/%d0%b1%d0%b0%d0%b9%d0%b5%d1%81%d0%be%d0%b2%d1%81%d0%ba%d0%b8%d0%b9-%d0%bf%d0%be%d0%b4%d1%85%d0%be%d0%b4/

(9) Criminal face recognition 20% false positives - www.nytimes.com/2018/07/26/technology/amazon-aclu-facial-recognition-congress.html?

(10) Denoising images wo noiseless ground-truth news.developer.nvidia.com/ai-can-now-fix-your-grainy-photos-by-only-looking-at-grainy-photos/?ncid=--45511

NLP

(0) Autoencoders for text habr.com/company/antiplagiat/blog/418173/ - no clear conclusion?

(1) RNN use cases overview indico.cern.ch/event/722319/contributions/3001310/attachments/1661268/2661638/IML-Sequence.pdf

(2) ACL 2018 notes ruder.io/acl-2018-highlights/

Hardware

(0) Edge embeddable TPU devices aiyprojects.withgoogle.com/edge-tpu ?

(1) GeForce 11* finally coming soon? Prices for 1080Ti are falling now...

#digest

#deep_learning

NIH Clinical Center releases dataset of 32,000 CT images

Lesion data may make it easier for scientific community to identify tumor growth or new disease


snakers4 (Alexander), July 23, 05:15

2018 DS/ML digest 18

Highlights of the week

(0) RL flaws

thegradient.pub/why-rl-is-flawed/

thegradient.pub/how-to-fix-rl/

(1) An intro to AUTO-ML

www.fast.ai/2018/07/16/auto-ml2/

(2) Overview of advances in ML in last 12 months

www.stateof.ai/

Market / applied stuff / papers

(0) New Nvidia Jetson released

www.phoronix.com/scan.php?page=news_item&px=NVIDIA-Jetson-Xavier-Dev-Kit

(1) Medical CV project in Russia - 90% is data gathering

cv-blog.ru/?p=217

(2) Differentiable architecture search

arxiv.org/pdf/1806.09055.pdf

-- 1800 GPU days of reinforcement learning (RL) (Zoph et al., 2017)

-- 3150 GPU days of evolution (Real et al., 2018)

-- 4 GPU days to achieve SOTA in CIFAR => transferrable to Imagenet with 26.9% top-1 error

(3) Some basic thoughts about hyper-param tuning

engineering.taboola.com/hitchhikers-guide-hyperparameter-tuning/

(4) FB extending fact checking to mark similar articles

www.poynter.org/news/rome-facebook-announces-new-strategies-combat-misinformation

(5) Architecture behind Alexa choosing skills goo.gl/dWmXZf

- Char-level RNN + Word-level RNN

- Shared encoder, but attention is personalized

(6) An overview of contemporary NLP techniques

medium.com/@ageitgey/natural-language-processing-is-fun-9a0bff37854e

(7) RNNs in particle physics?

indico.cern.ch/event/722319/contributions/3001310/attachments/1661268/2661638/IML-Sequence.pdf?utm_campaign=Revue%20newsletter&utm_medium=Newsletter&utm_source=NLP%20News

(8) Google cloud provides PyTorch images

twitter.com/i/web/status/1016515749517582338

NLP

(0) Use embeddings for positions - no brainer

twitter.com/i/web/status/1018789622103633921

(1) Chatbots were a hype train - lol

medium.com/swlh/chatbots-were-the-next-big-thing-what-happened-5fc49dd6fa61

The vast majority of bots are built using decision-tree logic, where the bot’s canned response relies on spotting specific keywords in the user input.Interesting links

(0) Reasons to use OpenStreetMap

www.openstreetmap.org/user/jbelien/diary/44356

(1) Google deployes its internet ballons

goo.gl/d5cv6U

(2) Amazing problem solving

nevalalee.wordpress.com/2015/11/27/the-hotel-bathroom-puzzle/

(3) Nice flame thread about CS / ML is not science / just engineering etc

twitter.com/RandomlyWalking/status/1017899452378550273

#deep_learning

#data_science

#digest

RL’s foundational flaw

RL as classically formulated has lately accomplished many things - but that formulation is unlikely to tackle problems beyond games. Read on to see why!


snakers4 (spark_comment_bot), July 13, 05:22

2018 DS/ML digest 17

Highlights of the week

(0) Troubling trends with ML scholars

approximatelycorrect.com/2018/07/10/troubling-trends-in-machine-learning-scholarship/

(1) NLP close to its ImageNet stage?

thegradient.pub/nlp-imagenet/

Papers / posts / articles

(0) Working with multi-modal data distill.pub/2018/feature-wise-transformations/

- concatenation-based conditioning

- conditional biasing or scaling ("residual" connections)

- sigmoidal gating

- all in all this approach seems like a mixture of attention / gating for multi-modal problems

(1) Glow, a reversible generative model which uses invertible 1x1 convolutions

blog.openai.com/glow/

(2) Facebooks moonshots - I kind of do not understand much here

- research.fb.com/facebook-research-at-icml-2018/

(3) RL concept flaws?

- thegradient.pub/why-rl-is-flawed/

(4) Intriguing failures of convolutions

eng.uber.com/coordconv/ - this is fucking amazing

(5) People are only STARTING to apply ML to reasoning

deepmind.com/blog/measuring-abstract-reasoning/

Yet another online book on Deep Learning

(1) Kind of standard livebook.manning.com/#!/book/grokking-deep-learning/chapter-1/v-10/1

Libraries / code

(0) Data version control continues to develop dvc.org/features

#deep_learning

#data_science

#digest

Like this post or have something to say => tell us more in the comments or donate!

Troubling Trends in Machine Learning Scholarship

By Zachary C. Lipton* & Jacob Steinhardt* *equal authorship Originally presented at ICML 2018: Machine


snakers4 (Alexander), July 04, 07:57

2018 DS/ML digest 15

What I filtered through this time

Market / news

(0) Letters by big company employees against using ML for weapons

- Microsoft

- Amazon

(1) Facebook open sources Dense Pose (eseentially this is Mask-RCNN)

- research.fb.com/facebook-open-sources-densepose/

Papers / posts / NLP

(0) One more blog post about text / sentence embeddings goo.gl/Zm8C2c

- key idea different weighting

(1) One more sentence embedding calculation method

- openreview.net/pdf?id=SyK00v5xx ?

(2) Posts explaing NLP embeddings

- www.offconvex.org/2015/12/12/word-embeddings-1/ - some basics - SVD / Word2Vec / GloVe

-- SVD improves embedding quality (as compared to ohe)?

-- use log-weighting, use TF-IDF weighting (the above weighting)

- www.offconvex.org/2016/02/14/word-embeddings-2/ - word embedding properties

-- dimensions vs. embedding quality www.cs.princeton.edu/~arora/pubs/LSAgraph.jpg

(3) Spacy + Cython = 100x speed boost - goo.gl/9TwVqu - good to know about this as a last resort

- described use-case

you are pre-processing a large training set for a DeepLearning framework like pyTorch/TensorFlow

or you have a heavy processing logic in your DeepLearning batch loader that slows down your training

(4) Once again stumbled upon this - blog.openai.com/language-unsupervised/

(5) Papers

- Simple NLP embedding baseline goo.gl/nGujzS

- NLP decathlon for question answering goo.gl/6HHi7q

- Debiasing embeddings arxiv.org/abs/1806.06301

- Once again transfer learning in NLP by open-AI - goo.gl/82VR4U

#deep_learning

#digest

#data_science

Download full.pdf 0.04 MB

snakers4 (Alexander), July 02, 04:51

2018 DS/ML digest 14

Amazing article - why you do not need ML

- cyberomin.github.io/startup/2018/07/01/sql-ml-ai.html

- I personally love plain-vanilla SQL and in 90% of cases people under-use it

- I even wrote 90% of my JSON API on our blog in pure PostgreSQL xD

Practice / papers

(0) Interesting papers from CVPR towardsdatascience.com/the-10-coolest-papers-from-cvpr-2018-11cb48585a49

(1) Some down-to-earth obstacles to ML deploy habr.com/company/hh/blog/415437/

(2) Using synthetic data for CNNs (by Nvidia) - arxiv.org/pdf/1804.06516.pdf

(3) This puzzles me - so much effort and engineering spent on something ... strange and useless - taskonomy.stanford.edu/index.html

On paper they do a cool thing - investigate transfer learning between different domains, but in practice it is done on TF and there is no clear conclusion of any kind

(4) VAE + real datasets siavashk.github.io/2016/02/22/autoencoder-imagenet/ - only small Imagenet (64x64)

(5) Understanding the speed of models deployed on mobile - machinethink.net/blog/how-fast-is-my-model/

(6) A brief overview of multi-modal methods medium.com/mlreview/multi-modal-methods-image-captioning-from-translation-to-attention-895b6444256e

Visualizations / explanations

(0) Amazing website with ML explanations explained.ai/

(1) PCA and linear VAEs are close pvirie.wordpress.com/2016/03/29/linear-autoencoders-do-pca/

#deep_learning

#digest

#data_science

No, you don't need ML/AI. You need SQL

A while ago, I did a Twitter thread about the need to use traditional and existing tools to solve everyday business problems other than jumping on new buzzwords, sexy and often times complicated technologies.


snakers4 (Alexander), June 28, 07:43

2018 DS/ML digest 13

Blog posts / articles:

(0) Google notes on CNN generalization - goo.gl/XS4KAw

(1) Google to teaching robots in virtual environment and then trasferring models to reality - goo.gl/aAYCqE

(2) Google's object tracking via image colorization - goo.gl/xchvBQ

(2) Interesting articles about VAEs:

- A small intro into VAEs habr.com/company/otus/blog/358946/

- A small intuitive intro (super super cool and intuitive)

towardsdatascience.com/intuitively-understanding-variational-autoencoders-1bfe67eb5daf

- KL divergence explained

www.countbayesie.com/blog/2017/5/9/kullback-leibler-divergence-explained

- A more formal write-up arxiv.org/abs/1606.05908

- In (RU) habr.com/company/otus/blog/358946/

- Converting a FC layer into a conv layer cs231n.github.io/convolutional-networks/#convert

- A post by Fchollet blog.keras.io/building-autoencoders-in-keras.html

A good in-depth write-up on object detection:

- machinethink.net/blog/object-detection/

- finally a decent explanation of YOLO parametrization machinethink.net/images/object-detection/[email protected]

- best comparison of YOLO and SSD ever - machinethink.net/images/object-detection/[email protected]

Papers with interesting abstracts (just good to know sich things exist)

- Low-bit CNNs - ai.intel.com/nervana/wp-content/uploads/sites/53/2018/06/ELQ_CameraReady_CVPR2018.pdf

- Automated Meta ML - arxiv.org/abs/1806.06927

- Idea - use ResNet blocks for boosting - arxiv.org/abs/1706.04964

- 2D-discrete-Fourier transform (2D-DFT) to encode rotational invariance in neural networks - arxiv.org/abs/1805.12301

- Smallify the CNNs - arxiv.org/abs/1806.03723

- BLEU review as a metric - conclusion - it is good on average to measure MT performance - www.mitpressjournals.org/doi/abs/10.1162/COLI_a_00322

"New" ideas in SemSeg:

- UNET + conditional VAE arxiv.org/abs/1806.05034

- Dilated convolutions for larget satellite images arxiv.org/abs/1709.00179 - looks like that this works only if you have high resolution with small objects

#digest

#deep_learning

How Can Neural Network Similarity Help Us Understand Training and Generalization?

Posted by Maithra Raghu, Google Brain Team and Ari S. Morcos, DeepMind In order to solve tasks, deep neural networks (DNNs) progressively...


snakers4 (Alexander), June 23, 12:10

Interesting links about Internet

- Ben Evans' digest - goo.gl/t9zG4y

- China plans to track cars - goo.gl/jeroFW

- Ben Evans - content is not king anymore - distribution / eco-system are goo.gl/ms2tQd

- Google opens AI center in Ghana - goo.gl/PRHBjq

- (RU) A funny case on censorship in Russia - funny article deleted from habr - sohabr.net/habr/post/414595/

-- It kind of clearly shows that you cannot safely post anything to habr

- India + WhatsApp + lynch mobs - goo.gl/tSBUCp

- Tor foundation about web-tracking and Facebook - goo.gl/H9DSuL

- Docker image jacking for crypto-mining - goo.gl/KrLLuQ

- Ethereum - 75% transactions automated bots - goo.gl/Q9BSNL

- (RU) - analyzing fake elections in Russia - 3-10M votes are fake - habr.com/post/358790/

#internet

2018 DS/ML digest 12

As usual, this is whatever I found really interesting / worth reading.

Implementations / papers / ideas

(0)

You can count bees well with UNet - matpalm.com/blog/counting_bees/

(1)

A really super cool idea - use affine transformations in 3D to stack augmentations on the level of transformation matrices

(3D augs are costly)

- gist.github.com/ematvey/5ca7df5d37c2f6a674390d42ef9e7d59

- both for rotation and scaling

- note a couple of things for easier understanding:

-- there is offset in tranformations - because the coordinate center is not in "center"

-- zoom essentially scales unit vectors after applying the offset

- 3Blue1Brown videos about linear algebra - www.youtube.com/watch?v=fNk_zzaMoSs

(2)

A top solution from Google's Landmark Challenge - goo.gl/pkZULZ

Essentially

- ensemble of features / skip connections from a CNN (ResNeXt)

- KNN

- use KNN + augment the extracted features by averaging with similar images

- query expansion (use the fact that different crops of the same landmark remain the same landmark)

(3)

(RU) A super cool series about interestring clustering algorithms

- Affinity propagation

-- habr.com/post/321216/

-- www.icmla-conference.org/icmla07/FreyDueckScience07.pdf

- DBSCAN habrahabr.ru/post/322034/

- (spoiler - in practice use awesome HDBSCAN library)

(4)

Brief review of image super-resolution techniques

- habr.com/post/359016/

- In a nutshell try in this order FCN CNNs, auto-encoders with skip connections or GANs

(5)

SOTA NLP by open-ai

blog.openai.com/language-unsupervised/

Key ideas

- Train a transformer language models on large corpus in an unsupervised way

- Fine-tune on a smaller task

- Profit

Caveats

- "Our approach requires an expensive pre-training step - 1 month on 8 GPUs" (probably this should be discounted somewhat)

- TF and unreadable enterprise code

(6)

One more claimed SOTA word embedding set

allennlp.org/elmo

(7)

A cool github page by Sebastian Ruder to track major NLP tasks

github.com/sebastianruder/NLP-progress

Visualizations

(0)

Amazing visual explanations of how decision trees work

- www.r2d3.us/visual-intro-to-machine-learning-part-2/

- it explains visually how overfitting occurs in decisions tree models

(1)

CIFAR T-SNE can be done in real-time on the GPU + tensorflow.js integration

- Blog goo.gl/Pk5Lq3

- Website goo.gl/1vpeFf

- Arxiv - arxiv.org/abs/1802.03680

- Demo - nicola17.github.io/tfjs-tsne-demo/

(2) Why people fail to use d3.js - goo.gl/hSt5dL

Datasets

(0) Nice idea - use available tools and videos to collect datasets

- goo.gl/HULsyH

- goo.gl/7AfRZZ

#digest

snakers4 (Alexander), June 12, 10:48

Interesting links about Internet

- Ben Evans' digest - goo.gl/7NkYn6

- Why it took so much time to create previews for Wikipedia - goo.gl/xg7N99

- Google postulating its AI principles? blog.google/topics/ai/ai-principles/

- Google product alternatives - goo.gl/RmA76N - I personally started to switch to more open-source stuff lately, but Docs and Android have no real options

- The future of ML in embedded devices - goo.gl/PjWpKj (sound ideas, but a post is by an evangelist)

- Yahoo messenger shutting down (20 years!) - goo.gl/uhomds - hi ICQ

- Microsoft Buys GitHub for $7.5 Billion - 16z write-up - goo.gl/3znstT

- NYC medallions dropped 5x in price - goo.gl/Vi7pG6

- JD covers villages in China with drone delivery already - goo.gl/bMGKSY

#digest

snakers4 (spark_comment_bot), June 06, 07:55

2018 DS/ML digest 11

Datasets

(0)

New Andrew Ng paper on radiology datasets

YouTube 8M Dataset post

As mentioned before - this is more or less blatant TF marketing

New papers / models / architectures

(0) Google RL search for optimal augmentations

- Blog, paper

- Finally Google paid attention to augmentations

- 83.54% top1 accuracy on ImageNet

- Discrete search problem, each policy consists of 5 sub-policies each each operation associated with two hyperparameters: probability and magnitude

- Training regime cosine decay for 200 epochs

- Top accuracy on ImageNet

- Best policy

- Typical examples of augmentations

(1)

Training CNNs with less data

Key idea - with clever selection of data you can decrease annotation costs 2-3x

(2)

Regularized Evolution for Image Classifier Architecture Search (AmoebaNet)

- The first controlled comparison of the two search algorithms (genetic and RL)

- Mobile-size ImageNet (top-1 accuracy = 75.1% with 5.1M parameters)

- ImageNet (top-1 accuracy = 83.1%)

Evolution vs. RL at Large-Compute Scale

• Evolution and RL do equally well on accuracy

• Both are significantly better than Random Search

• Evolution is faster

But the proper description of the architecture is nowhere to be seen...

Libraries / code / frameworks

(0) OpenCV installation for Ubuntu18 from source (if you need e.g. video support)

News / market

(0) Idea adversarial filters for apps - goo.gl/L4Vne7

(1) A list of 30 best practices for amateur ML / DL specialits - forums.fast.ai/t/30-best-practices/12344

- Some ideas about tackling naive NLP problems

- PyTorch allegedly supports just freezing bn layers

- Also a neat idea I tried with inception nets - assign different learning rates to larger models when fine-tuning them

(2) Stumbled upon a reference on NAdam as optimizer as being a bit better than Adam

It is also described in this popular article

(3) Barcode reader via OpenCV

#deep_learning

#digest

Like this post or have something to say => tell us more in the comments or donate!

snakers4 (Alexander), June 05, 14:42

A very useful combination in tmux

You can resize your panes via pressing

- first ctrl+b

- hold ctrl

- press arrow keys several time holding ctrl

...

- profit

#linux

#deep_learning

Digest about Internet

(0) Ben Evans Internet digest - goo.gl/uoQCBb

(1) GitHub purchased by Microsoft - goo.gl/49X74r

-- If you want to migrate - there are guides already - about.gitlab.com/2018/06/03/movingtogitlab/

(2) And a post on how Microsoft kind of ruined Skype - goo.gl/Y7MJJL

-- focus on b2b

--lack of focus, constant redesigns, faltering service

(3) No drop in FB usage after its controversies - goo.gl/V93j2v

(4) Facebook allegedly employes 1200 moderators for Germany - goo.gl/VBcYQQ

(5) Looks like many Linux networking tools have been outdated for years

dougvitale.wordpress.com/2011/12/21/deprecated-linux-networking-commands-and-their-replacements/

#internet

#digest

snakers4 (spark_comment_bot), May 21, 06:21

2018 DS/ML digest 11

Cool thing this week

(0) ML vs. compute stidy since 2012 - chart / link

Market

(0) Once again about Google Duplex

(1) Google announcements from Google IO

-- Email autocomplete

We encode the subject and previous email by averaging the word embeddings in each field. We then join those averaged embeddings, and feed them to the target sequence RNN-LM at every decoding step, as the model diagram below shows.

-- Learning Semantic Textual Similarity from Conversations blog, paper. Something in the lines of Sentence2Vec, but for conversations, self-supervised, uses attention and embedding averaging

-- Google Clips device + interesting moment estimation on the device. Looks like MobileNet distillation into a small network with some linear models on top

Libraries / tools / papers

(0) SaaS NLP annotation tool

(1) CNNs allegedly can reconstruct low light images? Blog, paper, Looks cool AF

(2) Cool thing to try in a new project - postgres restful API wrapper - such things require a lot of care though, but can elimininate a lot of useless work for small projects.

For my blog I had to write a simple business tier layer myself. I doubt that I could use this w/o overengineering because I constructed open-graph tags for example in SQL queries for example

Job / job market

(0) (RU) Realistic IT immigration story

Datasets

(0) Last week open images dataset was updated. I downloaded the small one for the sake of images. Though the download process itself is a bit murky

#machine-learning

#digest

#deep-learning

Like this post or have something to say => tell us more in the comments or donate!

snakers4 (spark_comment_bot), May 13, 11:25

2018 DS/ML digest 10

Market

(0) Some moonshots by Google in working with electronic health records

(1) Google duplex - a narrow domain bot that makes calls for you

(2) Nature wants to make its ML journal ... paid

(3) Standford DawnBench - training Imagenet encoders as quickly and cheaply as possible

(4) Facebook achieves 85% on Imagenet by training on 1bn images in 336 GPUs in a week

(5) Learning the models of the surrounding world based on a DOOM like game

Practice / libraries / code

(0) A smarter and new way to ensemble CNNs

- Traditional approach - ensemble CNNS with different architecture - and just vote / average / apply linear regression on top

- Newer approach - use Cyclic Learning rate

- Even newer approach - model snapshot ensembling

- Stochastic Weight Averaging

-- store running average of the models

-- train one model with CLR

-- at the end of each lr update (or epoch) - do a running average of the models with some weights

-- the gist of the method is located on this line

-- I do understand why the update bnorm params, but I do not understand why it cannot be done just running 1 train epoch

- Papers on CNN ensembling 1 2 3

(1) (RU) Small amount of technocal details, but face-detection + face hashing works in retail (+human operator) given an HD camera

(2) (RU) Pose estimation

(3) Numpy autograd

"New" papers worth mentioning

(0) SqueezeNext

- Module comparsion

- Key changes

(i) more aggressive channel reduction by incorporating a two-stage squeeze module

(ii separable 3 × 3 convolutions

(iii) element-wise addition skip co

nection similar to ResNet

- Performance

(1) GANs to generate full-body anime characters in different poses

Visualizations:

(0) (does not work in Firefox) Visualizing encoder-decoder networks for translation

#data-science

#deep-learning

#digest

Like this post or have something to say => tell us more in the comments or donate!

Deep Learning for Electronic Health Records

Posted by Alvin Rajkomar MD, Research Scientist and Eyal Oren PhD, Product Manager, Google AI When patients get admitted to a hospital, th...


snakers4 (Alexander), May 10, 05:39

Internet

Interesting links about Internet

(0) Ben Evans goo.gl/gvNBhS

Russia / CIS

(0) Telegram has a new proxy setting in alpha, though no proper stand-alone solutions are published

t.me/dvachannel/21784

(1) Western media now cover Telegram

goo.gl/nPJ4Sm

Global / tech

(0) Xiaomi to file for an IPO - US$10 - US$100bn

(1) Yet another drag and drop ML that will (m?) fail - lobe.ai/ - this is so American

(2) Now all "major" apps heavily feature "stories" as main mobile format - goo.gl/wbnHYD

Yet another reason to quit all social media and just use professional apps / messaging

Add up all this bs => this is the reason normal people do not use social media for real now

(3) Tesla most shorted tech company now - goo.gl/11yndY xD

Figures

(0) YouTube - 1.8bn users with 1+ login goo.gl/kyXFDH

(1) WhatsApp m70bn messages per day (vs. 20bn max with SMS) goo.gl/67DdVn

#internet

#digest

snakers4 (Alexander), May 01, 16:52

2018 DS/ML digest 9

Market / libraries

(0) Tensorflow + Swift - wtf - goo.gl/FDvLM4

(1) Geektimes / Habrhabr.ru going international - goo.gl/dbGNwD

(2) A service for renting GPUs ... from people

- Reddit goo.gl/HxQ54x

- Link vectordash.com/hosting/

- Looks LXC based (afaik - the only user friendly alternative to Docker)

- Cool in theory, no idea how secure this is - we can assume as secure as providing a docker container to stranger

- They did not reply me in a week

(3) A friend sent me a new list of ... new yet another PyTorch NLP libraries

- goo.gl/kasRfZ, goo.gl/XXnbJy (AllenNLP is the biggest library like this)

- I believe that such libraries are more or less useless for real tasks, but cool to know they exist

(4) New SpaceNet 4? goo.gl/CsSS6P

(5) A new super cool competition on Kaggle about particle physics? www.kaggle.com/c/trackml-particle-identification

Tutorials / basics

(0) Bias vs. Variance (RU) goo.gl/4Y7tH7

(1) Yet another magic Jupyter guideline collection - goo.gl/AFWMuq

Real world ML applications

(0) Resnet + object detection (RU) - people wo helmets 90% accuracy - goo.gl/7xpQnE

(1) Fast.ai about using embeddings with Tabular data - www.fast.ai/2018/04/29/categorical-embeddings/

Very similar to our approach on electricity

I personally do not recommend using their library by all means

(2) Comparing Google TPU vs. V100 with ResNet50 - goo.gl/s6dhsy

- speed - goo.gl/Pww2sm

- pricing - goo.gl/Rtkp8Q

- but ... buying GPUs is much cheaper

(3) Other blog posts about embeddings + tabular data

- Sales prediction blog.kaggle.com/2016/01/22/rossmann-store-sales-winners-interview-3rd-place-cheng-gui/

- Taxi drive prediction blog.kaggle.com/2015/07/27/taxi-trajectory-winners-interview-1st-place-team-%F0%9F%9A%95/

MLP + classification + embeddings - goo.gl/AMNGNG / arxiv.org/pdf/1508.00021.pdf

(4) Albu's solution to SpaceNet - augmentations github.com/SpaceNetChallenge/RoadDetector/tree/master/albu-solution/src/augmentations

CNN overview

Neural network part:

Split data to 4 folds randomly but the same number of each city tiles in every fold

Use resnet34 as encoder and unet-like decoder (conv-relu-upsample-conv-relu) with skip connection from every layer of network. Loss function: 0.8*binary_cross_entropy + 0.2*(1 – dice_coeff). Optimizer – Adam with default params.

Train on image crops 512*512 with batch size 11 for 30 epoch (8 times more images in one epoch)

Train 20 epochs with lr 1e-4

Train 5 epochs with lr 2e-5

Train 5 epochs with lr 4e-6

Predict on full image with padding 22 on borders (1344*1344).

Merge folds by mean

Jobs / job market

(0) Developers by country by scraping GitHub - goo.gl/n8gnLi

- developers count vs. GDP prntscr.com/j9v80e R^2 = 84%

- developers count vs. population - R^2 = 50%

Visualization

(0) Interactive tool for visualizing convolutions - ezyang.github.io/convolution-visualizer/

Datasets

(0) Open Images v4 outsourced

- research.googleblog.com/2018/04/announcing-open-images-v4-and-eccv-2018.html

- the dataset itself storage.googleapis.com/openimages/web/download.html

- categories storage.googleapis.com/openimages/2018_04/bbox_labels_600_hierarchy_visualizer/circle.html

#data_science

#deep_learning

#digest

tensorflow/swift

swift - Swift for TensorFlow documentation repository.


snakers4 (Alexander), April 15, 08:06

2018 DS/ML digest 8

As usual my short bi-weekly (or less) digest of everything that passed my BS detector

Market / blog posts

(0) Fast.ai about the importance of accessibility in ML - www.fast.ai/2018/04/10/stanford-salon/

(1) Some interesting news about market, mostly self-driving cars (the rest is crap) - goo.gl/VKLf48

(2) US$600m investment into Chinese face recognition - goo.gl/U4k2Mg

Libraries / frameworks / tools

(0) New 5 point face detector in Dlib for face alignment task - goo.gl/T73nHV

(1) Finally a more proper comparsion of XGB / LightGBM / CatBoost - goo.gl/AcszWZ (also see my thoughts here snakers41.spark-in.me/1840)

(3) CNNs on FPGAs by ZFTurbo

-- www.youtube.com/watch?v=Lhnf596o0cc

-- github.com/ZFTurbo/Verilog-Generator-of-Neural-Net-Digit-Detector-for-FPGA

(4) Data version control - looks cool

-- dataversioncontrol.com

-- goo.gl/kx6Qdf

-- but I will not use it - becasuse proper logging and treating data as immutable solves the issue

-- looks like over-engineering for the sake of overengineering (unless you create 100500 datasets per day)

Visualizations

(0) TF Playground to seed how simplest CNNs work - goo.gl/cu7zTm

Applications

(0) Looks like GAN + ResNet + Unet + content loss - can easily solve simpler tasks like deblurring goo.gl/aviuNm

(1) You can apply dilated convolutions to NLP tasks - habrahabr.ru/company/ods/blog/353060/

(2) High level overview of face detection in ok.ru - goo.gl/fDUXa2

(3) Alternatives to DWT and Mask-RCNN / RetinaNet? medium.com/@barvinograd1/instance-embedding-instance-segmentation-without-proposals-31946a7c53e1

- Has anybody tried anything here?

Papers

(0) A more disciplined approach to training CNNs - arxiv.org/abs/1803.09820 (LR regime, hyper param fitting etc)

(1) GANS for iamge compression - arxiv.org/pdf/1804.02958.pdf

(2) Paper reviews from ODS - mostly moonshots, but some are interesting

-- habrahabr.ru/company/ods/blog/352508/

-- habrahabr.ru/company/ods/blog/352518/

(3) SqueezeNext - the new SqueezeNet - arxiv.org/abs/1803.10615

#digest

#data_science

#deep_learning

snakers4 (Alexander), April 07, 11:52

Internet digest

- Ben Evans - mailchi.mp/ben-evans/benedicts-newsletter-no-450525?e=b7fff6bc1c

- About autonomous cars - www.ben-evans.com/benedictevans/2018/3/26/steps-to-autonomy - autonomy will vary based on the route / conditions / situation / use case

- FB delays its speaker - www.bloomberg.com/technology

- Foxconn buys Belking goo.gl/Xf6g9A

- Amazon music > 10m subs - goo.gl/C8Qhdm

- The Economist about ML in business - goo.gl/fTCHE9

- Apple to make its own chips - goo.gl/ZkkEVc

#internet

#digest

snakers4 (Alexander), March 30, 10:35

Internet digest

- Chrome OS on tablets - goo.gl/K5iCJw

- Facial recognition in China - goo.gl/aJjPH5 - 1984

- Ikea + AR manual - goo.gl/WW6Eqg

- WildBerries.ru stats - goo.gl/qPspe1

- Digital content forgery and ML - goo.gl/e5tqWa

- On Facebook tracking your SMS and calls

-- newsroom.fb.com/news/2018/03/fact-check-your-call-and-sms-history/

#digest

#internet

Google debuts Chrome OS tablets to take on the iPad in education

Ahead of Apple’s education-focused event tomorrow where a new affordable iPad is expected, Google this morning announced the first Chrome OS tablet. The Acer Chromebook Tab 10 is a new form f…


snakers4 (Alexander), March 20, 05:03

Internet / tech

(1) LIDAR - bridge technology www.ben-evans.com/benedictevans/2018/3/12/bridges

(2) VW to invest US$25bn in batteries goo.gl/yPrpUX

(3) Self-driving car kills a pedestrian goo.gl/Md3Cbs

(4) Terminal case of marketing bs - theranos - goo.gl/zNjZPL

(5) Spotify was a P2P app at first lol - goo.gl/e8riLc

(6) Stack Overflow survey 2018 - stackoverflow.blog/2018/01/08/take-2018-developer-survey/

Lol

(1) Prototype of small flying car - cora.aero

#digest

Bridges and LIDAR

A bridge product says 'of course x is the right way to do this, but the technology or market environment to deliver x is not available yet, or is too expensive, and so here is something that gives some of the same benefits but works now.'  Sometimes that’s a great business, and sometimes it


snakers4 (Alexander), March 13, 10:08

Internet digest

(1) Ben Evans - goo.gl/8f4RkE

Market

(1) Waymo launching pilot for the self-driving trucks - goo.gl/Bw2R9Q

(2) Netflix to spend US$8bn on ~700 shows in 2018 - goo.gl/6myKj6 (sic!)

(3) Intel vs Qualcomm and Broadcomm - goo.gl/pa3iYB + Inter considering to buy Broadcomm - goo.gl/XP8fqd

(4) Amazon buys ring - goo.gl/cnMw6o

(5) Latest darkmarket bust - Hansa - goo.gl/YcUxYD - it was not busted at once, but put under surveillance

- As with Silk Road - all started with the officials finding a server and making a copy of hard drive

- This time - it was a dev server

- It contained ... owners' IRC accounts and some personal info

Internet + ML

(1) Netflix uses ML to generate thumbnails for its shows automatically - goo.gl/6poibk

- Features collected: manual annotation, meta-data, object detection, brightness, colour, face detection, blur, motion detection, actors, mature content

#internet

#digest

Also also

(1) Dropbox - www.sec.gov/Archives/edgar/data/1467623/000119312518055809/d451946ds1.htm

(2) And Spotify www.sec.gov/Archives/edgar/data/1639920/000119312518063434/d494294df1.htm

filed for IPOs

#internet

snakers4 (Alexander), March 10, 13:59

Interesting / noteworthy semseg papers

In practice - UNet and LinkNet are best and simple solutions.

Rarely people report that something like Tiramisu works properly.

Though I saw once in last Konika competition - a good solution based on DenseNet + Standard decoder.

So I decided to read some of the newer and older Semseg papers.

Classic papers

UNet,LinkNet - nuff said

(0) Links

- UNet - arxiv.org/abs/1505.04597

- LinkNet - arxiv.org/abs/1707.03718

Older, overlooked, but interesting papers

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

One of original papers before UNet

(0) arxiv.org/abs/1511.00561

(1) Basically UNet w/o skip connections but it stores pooling indices

(1) SegNet uses the max pooling indices to upsample (without learning) the feature map(s) and convolves with a trainable decoder filter bank

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

Paszke, Adam / Chaurasia, Abhishek / Kim, Sangpil / Culurciello, Eugenio

(0) Link

- arxiv.org/abs/1606.02147

(1) Key facts

- Up to 18× faster, 75× less FLOPs, 79× less parameters vs SegNet or FCN

- Supposedly runs on NVIDIA Jetson TX1 Embedded Systems

- Essentially a minzture of ResNet and Inception architectures

- Overview of the architecture

-- goo.gl/M6CPEv

-- goo.gl/b5Kb2S

(2) Interesting ideas

- Visual information is highly spatially redundant, and thus can be compressed into a more efficient representation

- Highly assymetric - decoder is much smaller

- Dilated convolutions in the middle => significant accuracy boost

- Dropout > L2

- Pooling operation in parallel with a convolution of stride 2, and concatenate resulting feature maps

Newer papers

xView: Objects in Context in Overhead Imagery - new "Imagenet" for satellite images

(0) Link

- Will be available here xviewdataset.org/#register

(1) Examples

- goo.gl/JKr9wW

- goo.gl/TWRmn2

(2) Stats

- 0.3m ground sample distance

- 60 classes in 7 different parent classes

- 1 million labeled objects covering over 1,400 km2 of the earth’s surface

- classes goo.gl/v9CM5b

(3) Baseline

- Their baseline using SSD has very poor performance ~20% mAP

Rethinking Atrous Convolution for Semantic Image Segmentation

(0) Link

- arxiv.org/abs/1706.05587

- Liang-Chieh Chen George / Papandreou Florian / Schroff Hartwig / Adam

- Google Inc.

(1) Problems to be solved

- Reduced feature resolution

- Objects at multiple scales

(2) Key approaches

- Image pyramid (reportedly works poorly and requires a lot of memory)

- Encoder-decoder

- Spatial pyramid pooling (reportedly works poorly and requires a lot of memory)

(3) Key ideas

- Atrous (dilated) convolution - goo.gl/uSFCv5

- ResNet + Atrous convolutions - goo.gl/pUjUBS

- Atrous Spatial Pyramid Pooling block goo.gl/AiQZC1 - goo.gl/p63qNR

(4) Performance

- As with the latest semseg methods, true performance boost is unclear

- I would argue that such methods may be useful for large objects

#digest

#deep_learning

snakers4 (Alexander), March 07, 05:03

2018 DS/ML digest 6

Visualization

(1) A new amazing post by Google on distil - distill.pub/2018/building-blocks/.

This is really amazing work, but their notebooks tells me that it is a far cry from being able to be utilized by the community - goo.gl/3c1Fza

This is how the CNN sees the image - goo.gl/S4KT5d

Expect this to be packaged as part of Tensorboard in a year or so)

Datasets

(1) New landmark dataset by Google - goo.gl/veSEhg - looks cool, but ...

Prizes in the accompanying Kaggle competitions are laughable goo.gl/EEGDEH goo.gl/JF93Xx

Given that datasets are really huge...~300G

Also also if you win, you will have to buy a ticket to the USA on your money ...

(2) Useful script to download the images goo.gl/JF93Xx

(3) Imagenet for satellite imagery - xviewdataset.org/#register - pre-register

arxiv.org/pdf/1802.07856.pdf paper

(4) CVPR 2018 for satellite imagery - deepglobe.org/challenge.html

Papers / new techniques

(1) Improving RNN performance via auxiliary loss - arxiv.org/pdf/1803.00144.pdf

(2) Satellite imaging for emergencies - arxiv.org/pdf/1803.00397.pdf

(3) Baidu - neural voice cloning - goo.gl/uJe852

Market

(1) Google TPU benchmarks - goo.gl/YKL9yx

As usual such charts do not show consumer hardware.

My guess is that a single 1080Ti may deliver comparable performance (i.e. 30-40% of it) for ~US$700-1000k, i.e. ~150 hours of rent (this is ~ 1 week!)

Miners say that 1080Ti can work 1-2 years non-stop

(2) MIT and SenseTime announce effort to advance artificial intelligence research goo.gl/MXB3V9

(3) Google released its ML course - goo.gl/jnVyNF - but generally it is a big TF ad ... Andrew Ng is better for grasping concepts

Internet

(1) Interesting thing - all ISPs have some preferential agreements between each other - goo.gl/sEvZMN

#digest

#data_science

#deep_learning

The Building Blocks of Interpretability

Interpretability techniques are normally studied in isolation. We explore the powerful interfaces that arise when you combine them -- and the rich structure of this combinatorial space.


snakers4 (Alexander), February 28, 10:40

Forwarded from Data Science:

Most common libraries for Natural Language Processing:

CoreNLP from Stanford group:

stanfordnlp.github.io/CoreNLP/index.html

NLTK, the most widely-mentioned NLP library for Python:

www.nltk.org/

TextBlob, a user-friendly and intuitive NLTK interface:

textblob.readthedocs.io/en/dev/index.html

Gensim, a library for document similarity analysis:

radimrehurek.com/gensim/

SpaCy, an industrial-strength NLP library built for performance:

spacy.io/docs/

Source: itsvit.com/blog/5-heroic-tools-natural-language-processing/

#nlp #digest #libs

Stanford CoreNLP

High-performance human language analysis tools. Widely used, aavailable open source; written in Java.


snakers4 (Alexander), February 24, 05:56

2017 DS/ML digest 5

Fun stuff

(1) Hardcore metal + CNNs + style transfer - goo.gl/VHYfHe

SpaceNet challenge

(1) Post by Nvidia goo.gl/6Mw4CB

(2) Some links to sota semseg articles

(3) Useful tools for CV - floodfill and grabcut, but guys from Nvidia did not notice ... that road width was in geojson data...

(4) Looks like they replicated the results just for PR, but their masks do not look appealing

Research / papers / libraries

(1) Neural Voice Cloning with a Few Samples - goo.gl/LwmzRf (demos audiodemos.github.io.)

(2) A library for CRFs in Python - goo.gl/cQc8hA

(3) 1000x faster CNN architecture search - still on CIFAR - arxiv.org/pdf/1802.03268.pdf (PyTorch goo.gl/BZ9Vrh)

(4) URLs + CNN - malicious link detection - arxiv.org/abs/1802.03162

Datasets

(1) 3m anime image dataset - www.gwern.net/Danbooru2017

(2) Google HDR dataset - goo.gl/XEL1Fm

Market

(1) Idea - AMT + blockchain - goo.gl/JfzEPV

(2) ARM to make processors for CNNs? - goo.gl/MpdPSB

(3) Google TPU in beta - goo.gl/gRzq9t - very expensive. + Note the rumours that Google's own people do not use their TPU quota

(4) One guy managed to deploy a PyTorch model using ONNX - goo.gl/QD4DkZ

#digest

#machine_learning

#data_science

Hardcore Anal Hydrogen "Jean-Pierre" (2018, Apathia Records)

Order "Hypercut" : http://apathia.link/hah Bandcamp : https://hardcoreanalhydrogen.bandcamp.com/album/hypercut « A gigantic piece of art here to mess with wh...


snakers4 (Alexander), February 20, 04:40

Internet Digest

- Ben Evans - goo.gl/XsBqHN

- Flipboard (orly) launches ads - goo.gl/2muoiT

- Google sold 3.9 million Pixel phones in 2017 - goo.gl/6eUiXw

- Looks like smartbuses may be cool. App => bus route information => route gap => launch cosy bus with music and social features - goo.gl/TjKndB (I doubt this is a business though)

- About the importance of decentralization - next Internet will be a set of cryptonetwork protocols - goo.gl/c2aB4n

- How London is responding to technological innovationgoo.gl/Dh6NgD

(1) Connected and autonomous vehicles (CAVs) or driverless (2) cars won't be on the road until the 2030s at least and could add to congestion

(3) Dockless cycle schemes need to be able to operate across London to be effective

(4) There is no control system in place for drones and droids

(5) TfL is monitoring technological developments but this needs to be embedded across the whole organisation

- Nice info graphics about city dwellers daily routes on pages 7-10 - goo.gl/vV71DR

#internet

#digest

snakers4 (Alexander), February 14, 11:48

2017 DS/ML digest 4

Applied cool stuff

- How Dropbox build their OCR - via CTC loss - goo.gl/Dumcn9

Fun stuff

- CNN forward pass done in Google Sheets - goo.gl/pyr44P

- New Boston Robotics robot - opens doors now - goo.gl/y6G5bo

- Cool but toothless list of jupyter notebooks with illustrations and models modeldepot.io

- Best CNN filter visualization tool ever - ezyang.github.io/convolution-visualizer/index.html

New directions / moonshots / papers

- IMPALA from Google - DMLab-30, a set of new tasks that span a large variety of challenges in a visually unified environment with a common action space

-- goo.gl/7ASXdk

-- twitter.com/DeepMindAI/status/961283614993539072

- Trade crypto via RL - goo.gl/NmCQSY?

- SparseNets? - arxiv.org/pdf/1801.05895.pdf

- Use Apple watch data to predict diseases arxiv.org/abs/1802.02511?

- Google - Evolution in auto ML kicks in faster than RL - arxiv.org/pdf/1802.01548.pdf

- R-CNN for human pose estimation + dataset

-- Website + video densepose.org

-- Paper arxiv.org/abs/1802.00434

Google's Colaboratory gives free GPUs?

- Old GPUs

- 12 hours limit, but very cool in theory

- habrahabr.ru/post/348058/

- www.kaggle.com/getting-started/47096#post271139

Sick sad world

- China has police Google Glass with face recognition goo.gl/qfNGk7

- Why slack sucks - habrahabr.ru/post/348898/

-- Email + google docs is better for real communication

Market

- Globally there are 22k ML developers goo.gl/1Jpt9P

- One more AI chip moonshot - goo.gl/199f5t

- Google made their TPUs public in beta - US$6 per hour

- CNN performance comparable to human level in dermatology (R-CNN) - goo.gl/gtgXVn

- Deep learning is greedy, brittle, opaque, and shallow goo.gl/7amqxB

- One more medical ML investment - US$25m for cancer - goo.gl/anndPP

#digest

#data_science

#deep_learning

snakers4 (Alexander), February 13, 08:19

Internet digest

- Ben Evans - goo.gl/7e1M4H

- FB tried to buy Snapchat 2 times - for US$60m and US$3b - goo.gl/xUVAM1

- Allegedly some ML can achieve 85% diabetes prediction accuracy on apple watch sensor data - goo.gl/Jyz5fG

- Cars may embrace 48 volts instead of 12 volts - goo.gl/Xmq9W5

- Google reabsorbs Nest (read between the lines - it was successful) - goo.gl/TzbTtY

- Snap +70% revenue growth - goo.gl/CQM6Xn

- 7 of 8 USA top grocers participate in Instacart - goo.gl/CAmoqA

- Siri APIs are fragmented lol - goo.gl/D6vvMK

- Uber agreed to provide Waymo, the self-driving car unit under Google’s parent company, Alphabet, with 0.34 percent of its stock - goo.gl/uatWBx

#internet

#digest

snakers4 (Alexander), February 07, 09:58

Internet digest

- Ben Evans - goo.gl/VKLgma

- Ben Evans about smart home hype - goo.gl/jPrCEd

- Google closing Google Fiber - goo.gl/urftJc

- Amazon tracks warehouse slackers with wristbands - goo.gl/avtMyn

- Apple music overtaking Spotify - goo.gl/ghQ43p

- Why people like infinite scroll goo.gl/tp1XNV

- Netflix personalizes artwork - goo.gl/dF5hLL

- Self-driving trucks => morel local trucking jobs goo.gl/tfaZSS

#internet

#digest

snakers4 (Alexander), February 01, 11:25

2017 DS/ML digest 2

Libraries

- One more RL library (last year saw 1 or 2) ray.readthedocs.io/en/latest/rllib.html

- Speech recognition from facebook - github.com/facebookresearch/wav2letter

- Even better speech generation than WaveNet - goo.gl/mTwyoV - I cannot tell computer apart

Industry (overdue news)

- Nvidia does not like it's consumer GPUs deployed in data centers goo.gl/n8mkxk

- Clarifai kills forevery goo.gl/PxcjvT

- Google search and gorillas vs. black people - goo.gl/t6LwLN

Blog posts

- Baidu - dataset size vs. accuracy goo.gl/j6M5ZP (log-scale)

-- goo.gl/AYan3f

-- goo.gl/JyVNHG

Datasets

- New Youtube actions dataset - arxiv.org/abs/1801.03150

-- arxiv.org/abs/1801.03150

Papers - current topic - meta learning / CNN optimization and tricks

- Systematic evaluation of CNN advances on the ImageNet arxiv.org/abs/1606.02228

-- prntscr.com/i8il35

- TRAINING DEEP NEURAL NETWORKS ON NOISY LABELS WITH BOOTSTRAPPING arxiv.org/abs/1412.6596

-- prntscr.com/i8iq1p

- Cyclical Learning Rates for Training Neural Networks arxiv.org/abs/1506.01186

-- prntscr.com/i8iqjx

- SEARCHING FOR ACTIVATION FUNCTIONS - arxiv.org/abs/1710.05941

-- prntscr.com/i8l0sd

-- prntscr.com/i8l5dp

- Large batch => train Imagenet in 15 mins

-- arxiv.org/abs/1711.04325

- Practical analysis of CNNs

-- arxiv.org/abs/1605.07678

#digest

#data_science

#deep_learning

snakers4 (Alexander), January 31, 07:14

Internet digest

- Ben Evans - goo.gl/XYKbvr

- RNNs + band names - goo.gl/LBBEiP

- Soldiers + fitness trackers = military bases - goo.gl/B4yzxX

- Google's new unit - security and ML - goo.gl/q1Xnjd

- Apple produces TV content - goo.gl/P2X9Gb

- Some bs rumours about Telegram ICO size - goo.gl/D4XgPD

- Twitter is plagued by bot-farms - goo.gl/ZLHVz1

-- Easy to detect via similar registration dates - goo.gl/ZLHVz1

- Podcast about financial innovations in the US - goo.gl/kxHUQY

#digest

#internet

Jeremy Fiance

recurrent neural network, trained on band names, generates fake @Coachella lineup - reminding us most band names are gibberish