BC DATA SCIENCE

Saved by @Jacobtldr

Maria Khalusova
@mariaKhalusova 4 years, 7 months ago 830 views

Save to PDF Share See On Twitter

To my JVM friends looking to explore Machine Learning techniques - you don’t necessarily have to learn Python to do that. There are libraries you can use from the comfort of your JVM environment. 🧵👇

https://t.co/EwwOzgfDca : Deep Learning framework in Java that supports the whole cycle: from data loading and preprocessing to building and tuning a variety deep learning networks.

https://t.co/J4qMzPAZ6u Framework for defining machine learning models, including feature generation and transformations, as directed acyclic graphs (DAGs).

https://t.co/9IgKkSxPCq a machine learning library in Java that provides multi-class classification, regression, clustering, anomaly detection and multi-label classification.

https://t.co/EAqn2YngIE : TensorFlow Java API (experimental)

https://t.co/7TY0viBfF5: ML algorithms, feature preprocessing and pipelines. Scalable through distributed computations.

https://t.co/9EVdIXwJuo: The toolkit for common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, coreference resolution, language detection and more!

https://t.co/AnxgGmsux2: distributed linear algebra framework and mathematically expressive Scala DSL designed to let mathematicians, statisticians, and data scientists quickly implement their own algorithms.

https://t.co/fiexCElwRp : Statistical Machine Intelligence and Learning Engine: classification, regression, clustering, association rule mining, feature selection, manifold learning, multidimensional scaling, genetic algorithms, missing value imputation, nearest neighbor search..

https://t.co/kDGCjszAaA Kotlin∇ is a type-safe automatic differentiation framework in Kotlin. It allows users to express differentiable programs with higher-dimensional data structures and operators.

(Not yet released) automatic differentiation system for the Kotlin language: https://t.co/9ANDDIVW8o

https://t.co/jKeboC2z0V open-source, high-level, engine-agnostic Java framework for deep learning. DJL is designed to be easy to get started with and simple to use for Java developers.

https://t.co/pXkvxumzrw - a set of simple, scalable and efficient tools that allow the building of predictive Machine Learning models without costly data transfers.

More from Data science

Simon DeDeo
@SimonDeDeo

On Bayesianism, the Many Worlds Interpretation, and personal identity.

Some thoughts worked out in a letter to a friend, which is the kind of thing you do when off Twitter for a glorious week. (🧵)

“Chance is ignorance”—the Bayesian story; all probabilities represent states of mind, not states of the world. One *could* put (some) chances “in the world”, but let’s take Occam’s Razor seriously...

That the probability of a fair coin coming up heads is 50% simply means that marginalizing (tracing, as the physicists say) over the hidden facts leaves you, nearly, maximally ignorant of the outcome.

Quantum uncertainty (access below!) poses an apparent challenge to this story. There seems to be nothing to be ignorant about when it comes to (say) electron spin—there is nothing “inside” the

The electron is a simple object, in other words. So where does the uncertainty come from? One could follow David Wallace’s wonderful interpretation in terms of chaotic dynamics and decoherence, but let’s see if we can take another route...

Greg Yang
@TheGregYang

1/ A ∞-wide NN of *any architecture* is a Gaussian process (GP) at init. The NN in fact evolves linearly in function space under SGD, so is a GP at *any time* during training. https://t.co/v1b6kndqCk With Tensor Programs, we can calculate this time-evolving GP w/o trainin any NN

2/ In this gif, narrow relu networks have high probability of initializing near the 0 function (because of relu) and getting stuck. This causes the function distribution to become multi-modal over time. However, for wide relu networks this is not an issue.

3/ This time-evolving GP depends on two kernels: the kernel describing the GP at init, and the kernel describing the linear evolution of this GP. The former is the NNGP kernel, and the latter is the Neural Tangent Kernel (NTK).

4/ Once we have these two kernels, we can derive the GP mean and covariance at any time t via straightforward linear algebra.

5/ So it remains to calculate the NNGP kernel and NT kernel for any given architecture. The first is described in https://t.co/cFWfNC5ALC and in this thread

Advertisement

Research Engineering a...
@turinghut23

✨✨ BIG NEWS: We are hiring!! ✨✨
Amazing Research Software Engineer / Research Data Scientist positions within the @turinghut23 group at the @turinginst, at Standard (permanent) and Junior levels 🤩

👇 Here below a thread on who we are and what we

We are a highly diverse and interdisciplinary group of around 30 research software engineers and data scientists 😎💻 👉 https://t.co/KcSVMb89yx #RSEng

We value expertise across many domains - members of our group have backgrounds in psychology, mathematics, digital humanities, biology, astrophysics and many other areas 🧬📖🧪📈🗺️⚕️🪐
https://t.co/zjoQDGxKHq
/ @DavidBeavan @LivingwMachines

In our everyday job we turn cutting edge research into professionally usable software tools. Check out @evelgab's #LambdaDays 👩‍💻 presentation for some examples:

We create software packages to analyse data in a readable, reliable and reproducible fashion and contribute to the #opensource community, as @drsarahlgibson highlights in her contributions to @mybinderteam and @turingway: https://t.co/pRqXtFpYXq #ResearchSoftwareHour

Pat Schloss
@PatSchloss

Wellll... A few weeks back I started working on a tutorial for our lab's Code Club on how to make shitty graphs. It was too dispiriting and I balked. A twitter workshop with figures and code:

When are you doing pie charts?
— #BlackLivesMatter (@surt_lab) October 13, 2020

Here's the code to generate the data frame. You can get the "raw" data from https://t.co/jcTE5t0uBT

Obligatory stacked bar chart that hides any sense of variation in the data

Obligatory stacked bar chart that shows all the things and yet shows absolutely nothing at the same time

STACKED Donut plot. Who doesn't want a donut? Who wouldn't want a stack of them!?! This took forever to render and looked worse than it should because coord_polar doesn't do scales="free_x".

Ryan J. Gallagher
@ryanjgallag

Tired of word clouds? Want to do better sentiment analysis? Not sure how to look at the words underneath your measures?

Our long overdue paper on generalized word shift graphs is finally here!
https://t.co/lIBXvbMJWX
https://t.co/vSL1REYT8V

So what are they?

1/n

If we have two texts, there are many ways we can compare them. Weighted averages are a particularly useful measure because they're flexible and interpretable

Proportions, Shannon entropy, the KLD, the JSD, and dictionary methods can all be written as weighted averages

2/n

But weighted avgs are also slippery. When we try to compress complex phenomena like happiness, surprise, divergence, or diversity into a single number, it can be unclear what we're measuring

If the measure goes up, what does that mean? Why did it do that? Can we trust it?

3/n

Very often, that's the end of the line and we're left with an uneasy feeling in the pit of our stomach that our weighted avg is actually picking up a data artifact or some other unintended peculiarity

Word shift graphs help us address those concerns

4/n

First, word shifts look under the hood of weighted averages to see what's going on

All weighted averages are a sum of contributions from individual words. We can pull out those words, and rank which ones contribute the most to the difference between two texts

5/n

You May Also Like

$(\u3063\u25d4\u25e1\u25d4)\u3063 \U0001f499 \U0001d440\U0001d4b6rymo Belfast \U0001f499\U0001f331\U0001f340 \u24cb\U0001f397\ufe0f$

(っ◔◡◔)っ 💙 𝑀𝒶rymo Belfa...
@MarymoBelfast

THREAD PART 1.

On Sunday 21st June, 14 year old Noah Donohoe left his home to meet his friends at Cave Hill Belfast to study for school. #RememberMyNoah💙

He was on his black Apollo mountain bike, fully dressed, wearing a helmet and carrying a backpack containing his laptop and 2 books with his name on them. He also had his mobile phone with him.

On the 27th of June. Noah's naked body was sadly discovered 950m inside a storm drain, between access points. This storm drain was accessible through an area completely unfamiliar to him, behind houses at Northwood Road. https://t.co/bpz3Rmc0wq

"Noah's body was found by specially trained police officers between two drain access points within a section of the tunnel running under the Translink access road," said Mr McCrisken."

Noah's bike was also found near a house, behind a car, in the same area. It had been there for more than 24 hours before a member of public who lived in the street said she read reports of a missing child and checked the bike and phoned the police.

Khatvaanga
@khatvaanga

Sri Gurubhyo Namah.
Some tweets on Adi Shankaracharya and in particular the Guru Parampara of @sringerimath. [I had done this series earlier 5 years ago on account of Shishya Sweekaram].

Adi Shankara established the Sringeri Peetham as the 1st of 4 amnaaya peetams.

The four being
1. Sarada Peetam at Sringeri.
2. Govardhana Peetam at Puri
3. Kalika at Dwaraka
4. Jyothir at Badarikashramam.

He installed his 4 most prominent shishyas as heads of the peetams.

Sureshwaracharya was assigned to Sringeri as its head.

Sringeri is a corruption of Rshyashrunga giri.

Rshyashrunga maharshi [and his father Sri Vibandhaka Maharshi] did their tapas at this place.

It is at Sringeri that Shankara spent 12 of His precious 32 years!!

That gives you and idea how much He loved this place.

Rshyashruga maharshi had a horn on his forehead, a result of being born to a deer. His mere presence was enough to shower rains!!

Rshyashrunga was one who led Putrakamesthi Yaagam of Dasaratha Maharaj and which resulted in Sri Rama's birth!!

Advertisement

Pulp Librarian
@PulpLibrarian

Due to the pandemic you may not have visited your local library in a while. So come with me on a virtual library tour, courtesy of stock photography, to see what we do for a living...

Libraries are of course information resource centres, but in many ways they are so much more. To get the best out of them you need to really know your way around the stacks.

The enquiries desk is normally your first stop in a library, and this is where you will meet The Angry Librarian! Why is she angry? Because you keep asking her stupid questions!

"Are you open?"
"Do you have a toilet?"
"That chair's wobbly!"
"Why isn't it available in audiobook?"
"Someone else is on the computer and that's not fair!"

On and on it goes...

And that's why in the library we insist on silence. It's the only way to stop us swearing at all the idiotic things you ask us. And we've looked up a lot of old swear words: beardsplitter, bescumber, rantallion, smellfungus etc. We're such muckspouts...

$Liz (Welsh) Beechinor \U0001f605$

Liz (Welsh) Beechinor ...
@LizTweetsTech

My dad led an organization of over 300,000 employees.

When I took on my biggest marketing & events team to date (35 employees) I asked him how the *heck* he did it. Here's what he said and what helped me every day. 🧵

Remember you're leading all the employees under you, but you're not *managing* them all. You're managing your direct reports, which should be 5-7 individuals max. Focus on them.

When you interact with the employees you don't manage directly, get to know them as people first, that'll be the most valuable information in leading them. Remembering everything you learn about them is hard, but doing so will make you a superhero.

If you're not going to empower the managers under you, why do you have them? If you see room for improvement, let the manager know and let them make the change on their teams and come to you with questions/concerns. This will save you SO much time.

If you're the smartest person in any room - you're doing it wrong. As a leader, your job is bringing together the best people to get the job done - your skillset is identifying those people, not being one of them.

Anshul Pandey
@Anshulspiritual

IMPORTANCE OF SATYA NARAYAN KATHA AND ITS ORIGIN.

We all have heard and read the Satyanarayan katha. But none of us dont know the original stories that Shatanand or Sadhu or Kalavati must have heard. Also, as a habit we hear or read this katha

but dont know the proper way it should be conducted.

We find its mention in Skanda puraan(Reva khand) and Bhavishya Puraan(Pratisarg parva).
Both Bhagwatam and Gita call Bhagwan Narayan as Satv or Truth. So Satyanarayan denotes this name as the one which is Truth incarnate.

The story began in Naimisharanya when Vyasji asked Sutji about a way to eliminate all the sorrows and fulfill all the desires and about who is the devta who can grant these at the same time.

So Suutji first prayed to Bhagwan Satyanarayan who is no one else but Vishnu ji.

Then he remembered all his avatars.After that he narrated the story of Narad. Once while visiting Mrityulok, Naradji was pained to see the sufferings of People around. He was deeply moved by these sufferings and wanted to know the means by which we could rid ourselves from these.

So he went to Vishnu ji and after paying his respects stood in front of him. Naradji narrated the purpose of his visit.

So Bhagwan Vishnu said to him that in Satyug and Tretayug, Bhagwan eliminates sorrow in Vishnuswarup. In Dwapar he assumes different forms to help people.