Data pipeline to process my daily travel journal.
Setup the following environment variables.
export ALERT_WEBHOOK="Discord webhook url for logging."
export NOTES_BUCKET_NAME="Bucket storing the notes."
export TRAVEL_TABLE="Name of the DynamoDB table to stored output."
git clone https://github.com/JFBarryLi/travel.git
python -m venv venv
. venv/bin/activate
pip install -e .
pytest -vv
docker build -t travel .
./scripts/notes_pipeline
from pipeline.notes.nlp import process
text = 'This is an example text.'
process(text)
BERT base sentiment analysis trained on the GoEmotions dataset - arpanghoshal/EmoRoBERTa
from pipeline.notes.nlp import predict_sentiment
text = 'This is an example text.'
predict_sentiment(text)
from pipeline.notes.geo_code import geo_code
geo_code('Lisbon')
First update geo_code_overrides.py
Then run:
./scripts/geo_code_overrides.py
Day 123
City_name - Thursday, February 10, 2022
Body of text. Body of text. Body of text.
Body of text.
Body of text. Body of text.
See LICENSE for more information.