Please Log
In for full access to the web site. Note that this link will take you to
an external site (https://petrock.mit.edu) to authenticate, and then you will be redirected
back to this page.
1) String Matching
Type a string that matches the following regex pattern ^Ba+.*n$
Loading...
Type a string that matches the following regex pattern ^Hello{5,} (W|w)orld
Loading...
When using the stream editor tool 'sed', what is the purpose of the syntax \(\) within a regular expression?
Loading...
Which component of a standard 'sed' command is responsible for ensuring that all instances of a pattern on a single line are replaced, rather than just the first one?
Loading...
Which 'awk' feature is used to define actions that should occur before any lines of the input file are processed?
Loading...
2) Similarity
What is the primary goal of applying a 'Stemming' algorithm like Porter's Algorithm in text processing?
Loading...
What is the Jaccard Similarity between S1 = {A, B, C, D} and S2 = {C,D,E}
Loading...
What is the Jaccard Similarity between S1 = {A, B, C, D} and S2 = {C, H, A, S, E}
Loading...
What is the Cosine similarity of the phrases: "The best team" and "The best team"
Loading...
According to the TF-IDF model, what happens to the importance of a term if it appears in almost every document in a large corpus?
Loading...
3) Embeddings
What properties apply to Skip-gram? (Select all that apply)
Loading...
What is the primary function of 'Negative Sampling' during the training of word embedding models?
Loading...
4) Applications, RAG, and Vector Databases
In the context of Large Language Models (LLMs), what problem does Retrieval Augmented Generation (RAG) primarily help to solve?
Loading...
What is a major advantage of Retrieval Augmented Generation (RAG) when using Large Language Models (LLMs)?
Loading...
How does Locality Sensitive Hashing (LSH) differ from traditional cryptographic hashing in vector databases?
Loading...
In Navigable Small World (NSW) graphs used for vector indexing, what is the role of 'long-range' links?