Spark in me - Internet, data science, math, deep learning, philo

snakers4 @ telegram, 1365 members, 1673 posts since 2016

All this - lost like tears in rain.

Data science, deep learning, sometimes a bit of philosophy and math. No bs.

Our website
Our chat
DS courses review

snakers4 (Alexander), January 15, 08:33

First 2019 DS / ML digest

No particular highlights - just maybe ML industrialization vector is here to stay?




2019 DS/ML digest 01

2019 DS/ML digest 01 Статьи автора - Блог -

snakers4 (Alexander), January 10, 09:49

Someone implemented instance weighted CE loss for PyTorch


Pytorch instance-wise weighted cross-entropy loss

Pytorch instance-wise weighted cross-entropy loss. GitHub Gist: instantly share code, notes, and snippets.

snakers4 (Alexander), January 09, 09:15

Using nargs

Wrote about this a year ago.

Forgot about it, a friend reminded me.

You can pass lists to the python command line arguments.

parser.add_argument('--classifier_conf', default=[512, 2048, 5005], nargs='+', type=int)

and then just add params to your call as follows

--classifier_conf 512 2048 5005


snakers4 (Alexander), January 08, 03:12

Forwarded from Sava Kalbachou:

GitHub Free users now get unlimited private repositories

If you’re a GitHub user, but you don’t pay, this is a good week. Historically, GitHub always offered free accounts but the caveat was that your code had to be public. To get private repositories, you had to pay. Starting tomorrow, that limitation is gone. Free GitHub users now get unlimited private projects with up […]

snakers4 (Alexander), January 04, 04:08

Linux subsystem in Windows 10

It works and installs in literally 2 clicks (run one command in Powershell and then just one-click install your Linux distro of choice in Windows Store (yes, this very funny indeed))!

Why would you need this?

To make and backup files on one command for example =)

Something like this becomes reality on Windows:

cd /mnt/d/ && \

TIME=`date +%b-%d-%y` && \

FILENAME=working_files_tar-$TIME.tar.gz && \

INCREMENTAL_FILE=backup_data.snar && \

echo 'Using folderlist' $FOLDERS && \

tar -czg $(<folders_backup.txt) --listed-incremental=$INCREMENTAL_FILE --verbose -f $FILENAME

Also, you may add rsync or scp and you are good to go!

Also other potential use cases:

- You are somehow vendor locked (I depend on proprietary drivers for my thunderbolt port to attach an external GPU) or just are used to Windows' windows (or are just lazy to install Linux);

- You need one particular Linux program or you need to quickly test something / do not want to bother replicating your environment under Windows (yes, you can also run Docker, but there will be some learning curve);

- You run all of your programs remotely, and use your Windows machine as a thin client, but sometimes you need git / bash / rsync - i.e. to download movies from your personal NAS;


snakers4 (Alexander), December 31, 13:11

Happy holidays to everyone)

snakers4 (Alexander), December 30, 04:41

Spark in me 2018 annual retrospective


- My personal progress and some views;

- ML is still amazing, but there are no illusions anymore;

- Telegram is still amazing, but commercialization looms;

- FAIR is an inspiration;

- Imcinnes with UMAP and HDBSCAN as well;


Еще написал немного по-русски, немного со спецификой, если вам так удобнее



Spark in me - annual retrospective 2018

Spark in me - annual retrospective 2018 Статьи автора - Блог -

snakers4 (Alexander), December 29, 02:34

Yet another repo with all possible pre-trained imagenet models

Now on 4 frameworks...

Looks too good to be true



Sandbox for training large-scale image classification networks for embedded systems - osmr/imgclsmob

snakers4 (Alexander), December 29, 02:23

Environment setup for DS / ML / DL

Some time ago made a small guide for setting up an environment on a black Ubuntu machine.

If works both for CV and NLP.

If you like this, please tell me, I will add newer things:

- nvtop;

- CUDA10 with PyTorch 1.0;

- Scripts for managing GPU fan speed;




Contribute to snakers4/gpu-box-setup development by creating an account on GitHub.

snakers4 (Alexander), December 27, 04:54

snakers4 (Alexander), December 25, 18:37

Prevent NVIDIA GPUs' throttling on headless server

Prevent NVIDIA GPUs' throttling on headless server -

(My GPUs are ~70C under full load xD)

snakers4 (Alexander), December 25, 15:19

Practical creepiness

Now Google Photos explicitly shows that it knows faces of your family members.


snakers4 (Alexander), December 20, 12:12

Spell-checking on various scales in Russian

Bayes + n-gram rules = spell-checker for words / sentences


Исправляем опечатки в поисковых запросах

Наверное, любой сервис, на котором вообще есть поиск, рано или поздно приходит к потребности научиться исправлять ошибки в пользовательских запросах. Errare...

Neural Information Processing Systems

Welcome to NeurIPS 2018 Turorial Sessions. This tutorial on Visualization for Machine Learning will provide an introduction to the landscape of ML visualizaions, organized by types of users and their...

snakers4 (Alexander), December 19, 08:16

DS/ML digest 32


- A way to replace softmax in NMT;

- Large visual reasoning dataset;

- PyText;




2018 DS/ML digest 32

2018 DS/ML digest 32 Статьи автора - Блог -

snakers4 (Alexander), December 17, 13:18

A collaborative tool for discovering images.

snakers4 (Alexander), December 17, 09:24


- PyText from Facebook:

- TLDR - FastText meets PyTorch;

- Very similar to AllenNLP in nature;

- Will be useful if you can afford to write modules for their framework to solve 100 identical tasks (i.e. like Facebook with 200 languages);

- In itself - seems to be too high maintenance to use;

I will not use use it.




A natural language modeling framework based on PyTorch - facebookresearch/pytext

snakers4 (Alexander), December 15, 17:50

BigGANs: AI-Based High-Fidelity Image Synthesis
This episode was supported by "Anything outside life extension is a complete waste of time". See their papers: - Papers: www.ncbi.nlm.n...

snakers4 (Alexander), December 14, 19:10


NLP library build on top of PyTorch 1.0 by Facebook?

- No repo link though (

- The paper also mentions the same limited API as AllenNLP has ... =(


A natural language modeling framework based on PyTorch - facebookresearch/pytext

snakers4 (Alexander), December 14, 03:57

A Style-Based Generator Architecture for Generative Adversarial Networks
Paper (PDF): Authors: Tero Karras (NVIDIA) Samuli Laine (NVIDIA) Timo Aila (NVIDIA) Abstract: We propose an alternative generator a...

snakers4 (Alexander), December 10, 15:14

Forwarded from Админим с Буквой:

WireGuard — прекрасный VPN будущего?

Наступило время, когда VPN уже не является каким-то экзотическим инструментом бородатых сисадминов. Задачи у пользователей разные, но факт в том, что VPN стал...

snakers4 (Alexander), December 10, 04:27

Simpsons paradox

Nice explanation


Simpson’s Paradox and Interpreting Data

The challenge of finding the right view through data

snakers4 (Alexander), December 09, 07:59

DS/ML digest 31

Highlights of the week:

- PyTorch 1.0 released;

- Drawing with GANs;

- BERT explained;




2018 DS/ML digest 31

2018 DS/ML digest 31 Статьи автора - Блог -

snakers4 (Alexander), December 08, 05:17

PyTorch 1.0 release

View Release:



Tensors and Dynamic neural networks in Python with strong GPU acceleration - pytorch/pytorch

snakers4 (Alexander), December 07, 03:06

This AI Learns Human Movement From Videos
The paper "Towards Learning a Realistic Rendering of Human Behavior" is available here: Pick up cool perks on our Patre...

snakers4 (Alexander), December 06, 09:04

Painting with GANs

This looks just awesome.

I guess it will not work in real resolutions yet.


Painting with GANs from MIT-IBM Watson AI Lab

This demo lets you modify a selection of meaningful GAN units for a generated image by simply painting.

snakers4 (Alexander), December 05, 14:35

This kind of mirrors my own old post

Эпистемологическая ответственность

Почему мы должны нести ответственность за идеи, в которые верим и которыми пользуемся Статьи автора - Блог -

Forwarded from Вастрик.Пынь:

💌 Вастрик.Инсайд #37: Этикет, грамотность и обжорство в информационном поле

Давно хотелось поговорить на тему осознанного потребления информации и организации личного инфополя. Почему самоограничения и разборчивость — хорошо, но удалять все аккаунты и уходить в лес — не очень хорошая крайность. Если вы сами не организуете своё инфополе, это сделают за вас.

В конце выпуска есть формочка для вопросов на итоговый выпуск. Воспользуйтесь.

Вастрик.Инсайд #37

Этикет, грамотность и обжорство в информационном поле

snakers4 (Alexander), December 02, 09:40

A cheeky ML/DS themed sticker pack for our channel

Thanks to @birdborn for his art.

You are welcome to use it:

If you would like to contribute / create your own stickers - please ask around in our channel chat.


snakers4, November 30, 11:01

Channel Edit Photo

snakers4 (Alexander), November 29, 08:10

Article about the reality of CV in Russia / CIS


Also a bit on how to handle various types of "customers", who want to contract CV systems from you.

Warning - too much harsh reality)


snakers4 (Alexander), November 28, 11:55

DS/ML digest 30




2018 DS/ML digest 30

2018 DS/ML digest 30 Статьи автора - Блог -