Negative Sampling

Question:
With 300 features and 10,000 words, how many weights exist in the hidden layers and output layers each?

The problem was resolved by two approaches

Subsampling frequent words to decrease the number of training examples.
Modifying the optimization objective with a technique called “Negative Sampling”, which causes each training sample to update only a small percentage of the model’s weights.

Both two not only reduced the compute burden of the training process, but also improved the quality of their resulting word vectors as well.

Subsampling rate

$w_i$ is the word, and $z(w_i)$ is the fraction.
$P(w_i)$ is the probability of keeping the word:
$P(w_i) = (\sqrt{\frac{z(w_i)}{0.001}} + 1) \cdot \frac{0.001}{z(w_i)}$

$P(w_i)=1.0$ (100% chance of being kept) when $z(w_i)<=0.0026$.
$P(w_i)=0.5$ (50% chance of being kept) when $z(w_i)<=0.00746$.
$P(w_i)=0.033$ (3.3% chance of being kept) when $z(w_i)<=1.0$.

Skip-gram neural network has a tremendous number of weights, all of which would be updated slightly by every one of our billions of training samples!
Negative sampling addresses this by having each training sample only modify a small percentage of the weights, rather than all of them.

Note: Alternative approach is Hierachical Softmax

How to do it?

Randomly select just a small number of “negative” words (let’s say 5) to update the weights.
5~20 words for small datasets; 2~5 words for large
Also still update the weights for our “positive” word
Only 0.06% of the 3M weights in the output layer!
The “negative samples” are selected using a “unigram distribution”
$P(w_i) = \frac{ {f(w_i)}^{3/4} }{\sum_{j=0}^{n}\left( {f(w_j)}^{3/4} \right) }$

Markdown support

Write content using inline or external Markdown. Instructions and more info available in the readme.

<section data-markdown>
  ## Markdown support

  Write content using inline or external Markdown.
  Instructions and more info available in the [readme](https://github.com/hakimel/reveal.js#markdown).
</section>

Transition Styles

You can select from different transitions, like:
None - Fade - Slide - Convex - Concave - Zoom

Themes

reveal.js comes with a few themes built in:
Black (default) - White - League - Sky - Beige - Simple
Serif - Blood - Night - Moon - Solarized

Pretty Code

function linkify( selector ) {
  if( supports3DTransforms ) {

    var nodes = document.querySelectorAll( selector );

    for( var i = 0, len = nodes.length; i < len; i++ ) {
      var node = nodes[i];

      if( !node.className ) {
        node.className += ' roll';
      }
    }
  }
}

Code syntax highlighting courtesy of highlight.js.

Marvelous List

No order here
Or here
Or here
Or here

Fantastic Ordered List

One is smaller than...
Two is smaller than...
Three!

Tabular Tables

Item	Value	Quantity
Apples	$1	7
Lemonade	$2	18
Bread	$3	2

Clever Quotes

These guys come in two forms, inline: The nice thing about standards is that there are so many to choose from and block:

“For years there has been a theory that millions of monkeys typing at random on millions of typewriters would reproduce the entire works of Shakespeare. The Internet has proven this theory to be untrue.”

Intergalactic Interconnections

You can link between slides internally, like this.

Speaker View

There's a speaker view. It includes a timer, preview of the upcoming slide as well as your speaker notes.

Press the S key to try it out.

Export to PDF

Presentations can be exported to PDF, here's an example:

Global State

Set data-state="something" on a slide and "something" will be added as a class to the document element when the slide is open. This lets you apply broader style changes, like switching the page background.

State Events

Additionally custom events can be triggered on a per slide basis by binding to the data-state name.

Reveal.addEventListener( 'customevent', function() {
	console.log( '"customevent" has fired' );
} );

Take a Moment

Press B or . on your keyboard to pause the presentation. This is helpful when you're on stage and want to take distracting slides off the screen.

Much more

THE END

- Try the online editor
- Source code & documentation

Word2vec

Word Embedding

Content

What is Word Embedding

Recall Bag Of Words

2 key advantages

2 ways to implement word2vec

CBOW vs Skip-Gram

Word2vec architect

Why Named Word2Vec

Output Layer

Contextual Similarity Visuliazation

Negative Sampling

GloVe

GloVe VS Word2Vec