Project description

Reddit Reader

For any subreddit(s) you're interested in, search for relevant posts using keyword(s) and load the resulting text in the post and and top-level comments into LLMs/ LangChains.

Get your Reddit credentials ready

Visit Reddit App Preferences (https://www.reddit.com/prefs/apps) or https://old.reddit.com/prefs/apps/
Scroll to the bottom and click "create another app..."
Fill out the name, description, and redirect url for your app, then click "create app"
Now you should be able to see the personal use script, secret, and name of your app. Store those as environment variables REDDIT_CLIENT_ID, REDDIT_CLIENT_SECRET, and REDDIT_USER_AGENT respectively.
Additionally store the environment variables REDDIT_USERNAME and REDDIT_PASSWORD, which correspond to the credentials for your Reddit account.

Usage

LlamaIndex

from llama_index import VectorStoreIndex, download_loader

RedditReader = download_loader("RedditReader")

subreddits = ["MachineLearning"]
search_keys = ["PyTorch", "deploy"]
post_limit = 10

loader = RedditReader()
documents = loader.load_data(
    subreddits=subreddits, search_keys=search_keys, post_limit=post_limit
)
index = VectorStoreIndex.from_documents(documents)

index.query("What are the pain points of PyTorch users?")

LangChain

from llama_index import VectorStoreIndex, download_loader

from langchain.agents import initialize_agent, Tool
from langchain.llms import OpenAI
from langchain.chains.conversation.memory import ConversationBufferMemory

RedditReader = download_loader("RedditReader")

subreddits = ["MachineLearning"]
search_keys = ["PyTorch", "deploy"]
post_limit = 10

loader = RedditReader()
documents = loader.load_data(
    subreddits=subreddits, search_keys=search_keys, post_limit=post_limit
)
index = VectorStoreIndex.from_documents(documents)

tools = [
    Tool(
        name="Reddit Index",
        func=lambda q: index.query(q),
        description=f"Useful when you want to read relevant posts and top-level comments in subreddits.",
    ),
]
llm = OpenAI(temperature=0)
memory = ConversationBufferMemory(memory_key="chat_history")
agent_chain = initialize_agent(
    tools, llm, agent="zero-shot-react-description", memory=memory
)

output = agent_chain.run(input="What are the pain points of PyTorch users?")
print(output)

This loader is designed to be used as a way to load data into GPT Index and/or subsequently used as a Tool in a LangChain Agent. See here for examples.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.1.3

Feb 21, 2024

0.1.2

Feb 13, 2024

0.1.1

Feb 12, 2024

0.1.0

Feb 10, 2024

0.0.1

Feb 4, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_readers_reddit-0.1.3.tar.gz (3.2 kB view hashes)

Uploaded Feb 21, 2024 Source

Built Distribution

llama_index_readers_reddit-0.1.3-py3-none-any.whl (3.4 kB view hashes)

Uploaded Feb 21, 2024 Python 3

Hashes for llama_index_readers_reddit-0.1.3.tar.gz

Hashes for llama_index_readers_reddit-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`9b0774d145818d5ed4b778067f355dd11c1264813383b2d9028086f74a692f88`
MD5	`ae0dde122a3d6f5a5da73e99c2a0f85a`
BLAKE2b-256	`861dd244c8d0b6359a88f1eaa2fa9a35ce55896986032bd56fc1a6550193b61b`

Hashes for llama_index_readers_reddit-0.1.3-py3-none-any.whl

Hashes for llama_index_readers_reddit-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`900154ce844cf3d0cf7d64a3bdfd3cd91b51ea75a3b546a6b2316c0775fdac88`
MD5	`12ca99f59b8a3d6d1f1eeb00aff80d64`
BLAKE2b-256	`8eef937ca52ccbbf53e8cbeea39706f45167a94fa0ffd2b038bb665b118d3e26`