BC DATA SCIENCE

Saved by @Jacobtldr

Sophie Hill
@sophie_e_hill 3 years, 5 months ago 707 views

Save to PDF Share See On Twitter

This is a more wonky thread about how I made this visualization in #Rstats using the awesome visNetwork

First step is to create the underlying network data. We need one file of "nodes" - i.e. the people and organizations. And one file of "edges" - i.e. the connections between them.

I created these by hand, based on excellent investigate journalism:

Now we can pull these together to create a network visualization!

You'll notice that I included a column for "type" in the nodes file. This allows me to use different icons for people vs firms vs political organizations.

All the icons are taken from @fontawesome. I *think* the visNetwork 📦 currently only works with fontawesome version 4.7, which is a bit limited – e.g. I decided to use a book icon to represent the fringe Evangelical Christian sect "Exclusive Brethren"! 😂

I very much enjoyed getting to use the "incognito" icon to represent all the unknown donors that have funded Tory MP Owen Paterson's overseas jaunts!

The icons are also scaled by how many "edges" connect to each "node".

Unsurprisingly, this means that the UK government and the Conservative party emerge as the most connected nodes in this network!

The great thing about visNetwork 📦 is that it's SO easy to make this visualization interactive with #RShiny.

You can add pop-up boxes ("tool-tips") that show more information when the user hovers over a node or edge – perfect for linking to the original reporting that I used.

Check out the full code and data on github! https://t.co/rWuCxbCnW3

More from Data science

Jeremy Howard
@jeremyphoward

An amazing new project from @bearpelican was just released: https://t.co/DBov6sZTVS . A beautiful design; you can auto-generate a melody from chords, chords from a melody, and more.

It's technically brilliant, combining BERT, seq2seq, and Transformer XL
https://t.co/jF3mO5aXiu

It's also a wonderful example of leveraging and customizing the fastai framework in a deep & thoughtful way.

Here's the full set of blog posts diving in to this

Ryan J. Gallagher
@ryanjgallag

Tired of word clouds? Want to do better sentiment analysis? Not sure how to look at the words underneath your measures?

Our long overdue paper on generalized word shift graphs is finally here!
https://t.co/lIBXvbMJWX
https://t.co/vSL1REYT8V

So what are they?

1/n

If we have two texts, there are many ways we can compare them. Weighted averages are a particularly useful measure because they're flexible and interpretable

Proportions, Shannon entropy, the KLD, the JSD, and dictionary methods can all be written as weighted averages

2/n

But weighted avgs are also slippery. When we try to compress complex phenomena like happiness, surprise, divergence, or diversity into a single number, it can be unclear what we're measuring

If the measure goes up, what does that mean? Why did it do that? Can we trust it?

3/n

Very often, that's the end of the line and we're left with an uneasy feeling in the pit of our stomach that our weighted avg is actually picking up a data artifact or some other unintended peculiarity

Word shift graphs help us address those concerns

4/n

First, word shifts look under the hood of weighted averages to see what's going on

All weighted averages are a sum of contributions from individual words. We can pull out those words, and rank which ones contribute the most to the difference between two texts

5/n

Advertisement

Emil Wallner
@EmilWallner

Tips for AI writers:

1. Spend 30% of your effort on skimming all student ML papers (e.g. Stanford NLP CS224n) the past 3 years and prototype your favorites

The idea is everything. Pick an area you are interested in and ideally something that has a visual aspect to it

Most of my 'on the top of my mind' ideas were bad in retrospect. Skimming 100s of student papers will give you an overview of what's interesting.

Student papers are overlooked, easy to understand, and have good compute constraints.

2. Spend 30% on your effort on coding

Create an edge to the project. Apply it to something new and use FastAI or Keras to improve the accuracy with 5-30%.

3. Spend 30% writing an in-depth article

Have a north star article in terms of structure and quality. Find something that stretches you to your utmost capability. I used @copingbear’s Style transfer article:

4. Spend 10% marketing your project

Invest a week in studying the strategies to rank on sites like HN and Reddit, then use them. If you have an interesting result and a great article, you've done the hard work.

Greg Yang
@TheGregYang

1/ A ∞-wide NN of *any architecture* is a Gaussian process (GP) at init. The NN in fact evolves linearly in function space under SGD, so is a GP at *any time* during training. https://t.co/v1b6kndqCk With Tensor Programs, we can calculate this time-evolving GP w/o trainin any NN

2/ In this gif, narrow relu networks have high probability of initializing near the 0 function (because of relu) and getting stuck. This causes the function distribution to become multi-modal over time. However, for wide relu networks this is not an issue.

3/ This time-evolving GP depends on two kernels: the kernel describing the GP at init, and the kernel describing the linear evolution of this GP. The former is the NNGP kernel, and the latter is the Neural Tangent Kernel (NTK).

4/ Once we have these two kernels, we can derive the GP mean and covariance at any time t via straightforward linear algebra.

5/ So it remains to calculate the NNGP kernel and NT kernel for any given architecture. The first is described in https://t.co/cFWfNC5ALC and in this thread

Data Professor
@thedataprof

Cheat sheet that summarizes #DataScience in 10 pages
(Links in the comments below 👇)

2/ Link to the cheatsheet by Maverick

You May Also Like

$Brianne Kimmel \U0001f4ac$

Brianne Kimmel 💬...
@briannekimmel

Shocking fact: Millennial men are less likely to work than any other age and gender demographic in America.

Today, there are 500,000 young men missing from the U.S. workforce.

Research suggests video games & improved leisure tech plays a role in the problem. 👇 Thread:

Following the 2007 to 2009 recession, 25 to 34 year old men exited high school with fewer middle-skill job opportunities than years prior.

During this time, we saw an increased number of men living with parents & choosing unemployment over lower paying jobs.

It's estimated that 24M millennials live w/ their parents.

1 in 4 living in their parents’ home neither go to school nor work.

What's more surprising? 9 in 10 who lived with their parents a year ago are still living there w/ no plans to leave.

Economists are calling millennial men a lost generation.

According to economist David Dorn:

“If you get to the point where you’re turning 30, you’ve never held a real job and you don’t have a college education, then it is very hard to recover at that point.”

Economists suggest this choosiness is a generational trait.

Forbes interview w/ a high school educated man:

"I’m very quick to get frustrated when people refuse to pay me what I’m worth."
“People feel that they have choice nowadays, and they

Adam McKay
@GhostPanther

1) My Stan Lee story happened when I was in 5th grade. I was visiting NYC with my Dad and had read the address for Marvel in the comics I read constantly. I begged my Dad to take me and to his credit he did. And guess what? It was just an office.

2) We were down in the lobby and I was kind of crushed. I don’t know what I expected, The Thing and the Yancy Street Gang to be sitting around smoking cigars? Anyway, my Dad was taking a beat to figure where we were going next and a guy came up to us.

3) He was wearing a white shirt and tie and said to Dad “Is he disappointed because. The Marvel offices were just offices?” My Dad said yes and then the guy who had gray around his temples and a mustache said “hold on a second” and opened one of those office mailboxes with a key.

4) He then handed me a thick stack of EVERY SINGLE MARVEL COMIC COMING OUT THE NEXT MONTH. “Here you go. Keep reading Marvel comics” he said and then walked off. I left in a daze and about 15 minutes later it hit me “Gray around the temples, mustache... That was Stan Lee!”

5) Later when I wrote on the Ant Man movie I told Kevin Feige the story, the year, look of the guy etc and Kevin said “That’s exactly the kind of thing Stan would do and he would have been there then. That was him.” Rest In Peace Stan Lee and thank you for the comics.

Advertisement

Max Fagin
@MaxFagin

November is here, and that means a massive shift is coming. And by "massive" I am of course referring to the redefinition of the kilogram unit of mass that the world has been building up to for more than 100 years. Let me explain:

1/ I've had an unhealthy fascination with metrology (the study of measurement) ever since my 2nd year as a physics major when I took a class devoted to duplicating historic physics experiments, so please indulge me for going into heavy detail (get it?) about the kilogram.

2/ So what actually *defines* a unit of measurement? If you're American, you probably know a mile is 5280 feet and a foot is 12 inches and an inch is 2.54 centimeters etc. But where does this chain of definitions end? Is it turtles all the way down?

3/ It's actually not! For all units (even the imperial units used in America) the answers all end with the Système International (SI) unit definitions established and maintained for over 100 years by the Bureau International des Poids et Mesures (BIMP) in France.

4/ At the base of this tower are the SI base units. Just 7 SI base units define every other unit in existence. They are:

Kilogram, kg (mass)
Meter, m (distance)
Second, s (time)
Kelvin, K (temp)
Ampere, A (electric current)
Candela, cd (luminous intensity)
Mole, mol (quantity)

Jaya_Upadhyaya
@Jayalko1

NARHARI AND BHAGWAN PANDURANG/VITTHAL

Once a goldsmith by the name of Narhari lived in Pandharpur. He excelled in his craft. He had vowed not to look at Pandurang or visit his mandir ever, as he was a Shiva Bhakt.
One day, a wealthy merchant visited him.

He wanted a gold girdle studded with precious gems, to be made for Prabhu Vitthal. Narhari refused to make it as it was for Vitthal.
After many requests, he agreed to make the girdle but asked the merchant to bring the measurement from the mandir as he himself would not go there

The measurement was given n he made the girdle. When the girdle was put on Vitthal, it was loose. Narhari had to shorten it. Now the girdle became tight. So Narhari was forced to take measurement on his own. He had to go to the mandir but went blindfolded to avoid seeing Vitthal

In the Mandir, when blindfolded Narhari touched Vitthal, he felt as if he was touching Shiva with matted hair, the moon, snake in His neck and Trishul in His hand. Ecstatic with joy, he removed the blindfold to see Shiva but he found the murti of Vitthal with Rukmini there.

He realised the truth that there was no difference between Vitthal and Shiva. Narhari at once fell at the feet of Panduranga. He then took the measurement of the waist of Vitthal and this time the ornament fit the Murti perfectly.

Narayan Hari🙏🏻

Dr. Jane Clare Jones
@janeclarejones

So, on the subject of bonkers hyperbolic pretzeling over the Bell judgement, Grace 'destroy books I don't like & make inappropriate jokes about sterilising teenage girls' Lavery has some thoughts.

Tell me why my feminism is wrong Grace.

Oh

Well, if anyone thought the Bell judgment was going to make TRAs reconsider making massively overblown claims with no evidence backed up with nothing but a thick wadge of emotional blackmail.... HAHAHA, no one thought that.

A high court in the UK made a delimited judgment about teenager's ability to consent to puberty blockers. This puts all trans people everywhere in the world at risk.

Because if any human anywhere has any thoughts that deviate in any way from the rote line dictated by

the trans rights movement, this puts all trans people everywhere in mortal danger.

Let's be honest Grace. It doesn't put trans people at risk. It puts trans ideology at risk. Because trans ideology depends on the idea of innate gender identity, and the trans child is the

necessary material evidence of the ontology of gender identity.

That is, children are being medicalised to provide evidence to underwrite adults identities.

Nothing to see here.