Spaces:

Rogersurf
/

hrhub

Running

App Files Files Community

hrhub / SETUP_GUIDE.md

Roger Surf

Refactor: Professional Streamlit MVP

f15d7db 14 days ago

preview code

raw

history blame contribute delete

7.98 kB

A newer version of the Streamlit SDK is available: 1.52.1

Upgrade

🚀 HRHUB SETUP GUIDE

Quick Start Guide for Deployment

📦 What You Have

A complete, production-ready Streamlit application with:

✅ Professional code structure
✅ Mock data for MVP demo
✅ Interactive UI with network graphs
✅ Ready for GitHub + Streamlit Cloud deployment

⚡ OPTION 1: Quick Local Test (2 minutes)

For Mac/Linux:

cd hrhub
./run.sh

For Windows:

cd hrhub
run.bat

That's it! Open http://localhost:8501 in your browser.

🌐 OPTION 2: Deploy to Internet (10 minutes)

Step 1: Install Git (if not already)

Windows: Download from https://git-scm.com/
Mac: Install Xcode Command Line Tools
Linux: sudo apt install git

Step 2: Create GitHub Repository

Go to https://github.com/new
Repository name: hrhub
Keep it PUBLIC
Don't initialize with README (we have one)
Click "Create repository"

Step 3: Push Your Code

Open terminal/command prompt in the hrhub folder:

# Initialize git
git init

# Add all files
git add .

# Commit
git commit -m "Initial HRHUB MVP deployment"

# Connect to GitHub (replace YOUR-USERNAME)
git remote add origin https://github.com/YOUR-USERNAME/hrhub.git

# Push
git branch -M main
git push -u origin main

Step 4: Deploy on Streamlit Cloud

Go to https://share.streamlit.io
Click "Sign in" → Sign in with GitHub
Click "New app"
Fill in:
- Repository: YOUR-USERNAME/hrhub
- Branch: main
- Main file path: app.py
Click "Deploy!"

Wait 2-3 minutes and your app will be live! 🎉

You'll get a URL like: https://hrhub-YOUR-USERNAME.streamlit.app

🎯 Testing Your Deployment

What You Should See:

Header: "🏢 HRHUB - HR Matching System"
Demo Mode Banner: Blue info box saying mock data is active
Statistics: 4 metric cards showing:
- Total Matches: 10
- Average Score: ~65%
- Excellent Matches: 4
- Best Match: ~70%
Two Columns:
- Left: Candidate profile with expandable sections
- Right: Company matches (table or cards)
Network Graph: Interactive visualization at the bottom

Interaction Tests:

✅ Change slider in sidebar (matches 5-20)
✅ Change min score slider
✅ Switch view modes (Overview/Cards/Table)
✅ Expand candidate sections
✅ Hover over network graph nodes
✅ Drag nodes in the graph

🔧 Common Issues & Solutions

Issue 1: "streamlit: command not found"

Solution:

pip install streamlit

Issue 2: "Module not found"

Solution:

pip install -r requirements.txt

Issue 3: Port 8501 already in use

Solution:

streamlit run app.py --server.port 8502

Issue 4: Git push fails (authentication)

Solution:

Generate GitHub Personal Access Token:
- Settings → Developer settings → Personal access tokens → Generate new token
- Select "repo" scope
- Copy the token
When prompted for password, paste the token (not your GitHub password)

Issue 5: Streamlit Cloud deployment fails

Solution:

Check requirements.txt has all dependencies
Ensure app.py is in root directory
Check logs in Streamlit Cloud dashboard
Make sure repository is PUBLIC

📝 Next Steps (After Demo Works)

Phase 1: Generate Real Embeddings

Run your original code with save functionality:

import numpy as np
import pickle

# After generating embeddings...
np.save('candidate_embeddings.npy', candidate_embeddings)
np.save('company_embeddings.npy', company_embeddings)

with open('candidates_processed.pkl', 'wb') as f:
    pickle.dump(candidates, f)
    
with open('companies_processed.pkl', 'wb') as f:
    pickle.dump(companies_full, f)

Place files in hrhub/data/ folder

Phase 2: Create Real Data Loader

Create data/data_loader.py:

import numpy as np
import pickle
from utils.matching import find_top_matches

def load_embeddings():
    """Load pre-computed embeddings."""
    candidate_emb = np.load('data/candidate_embeddings.npy')
    company_emb = np.load('data/company_embeddings.npy')
    
    with open('data/candidates_processed.pkl', 'rb') as f:
        candidates = pickle.load(f)
    
    with open('data/companies_processed.pkl', 'rb') as f:
        companies = pickle.load(f)
    
    return candidate_emb, company_emb, candidates, companies

# Add functions matching mock_data.py structure

Phase 3: Swap Data Sources

In app.py, change:

# FROM:
from data.mock_data import get_candidate_data, get_company_matches

# TO:
from data.data_loader import get_candidate_data, get_company_matches

In config.py, change:

DEMO_MODE = False  # Turn off demo banner

That's it! The UI stays exactly the same.

🎓 For Your Teachers Demo

What to Show:

Start the app: Show the clean UI loading
Explain the candidate: "This represents a real data scientist profile"
Point out match scores: "70% means strong alignment"
Show the graph: "Green = candidate, Red = companies, thickness = match strength"
Demonstrate interaction: Drag nodes, zoom, hover
Highlight the concept: "No hardcoded rules - pure semantic similarity"

Key Points to Emphasize:

✅ Scalable: Works for 9.5K × 180K matching
✅ Fast: Real-time similarity computation
✅ Bilateral: Can work both directions
✅ No manual rules: NLP understands semantics
✅ Production-ready: Clean code, modular design

📊 Project Structure Explained

hrhub/
├── app.py              # Main app - teachers see this running
├── config.py           # Easy to tweak settings
├── requirements.txt    # All dependencies listed
│
├── data/
│   └── mock_data.py   # Demo data (swap later)
│
├── utils/
│   ├── matching.py    # Core algorithm - your innovation
│   ├── visualization.py  # Network graphs
│   └── display.py     # UI components
│
└── README.md          # Documentation

Why this structure?

Modular: Easy to swap mock → real data
Professional: Industry-standard layout
Maintainable: Clear separation of concerns
Scalable: Ready to add features

🎯 Timeline Suggestion

Tuesday (Today):

✅ Test locally: ./run.sh
✅ Deploy to GitHub
✅ Deploy to Streamlit Cloud
✅ Share link with team

Wednesday:

Run your original code
Generate & save embeddings
Test loading saved files

Thursday:

Create data_loader.py
Swap to real data
Test end-to-end
Fix any bugs

Friday:

✅ DEMO READY
Polish presentation
Prepare talking points

Weekend:

Focus 100% on report
App already deployed!

🆘 Need Help?

Quick Checks:

Is Python 3.8+ installed? python --version
Are dependencies installed? pip list | grep streamlit
Is the file structure correct? ls -la
Are you in the right directory? pwd

Still Stuck?

Check these in order:

Error message in terminal
Streamlit Cloud logs (if deployed)
GitHub Actions (if using)
This guide's "Common Issues" section

✅ Deployment Checklist

Before presenting to teachers:

Local test works: ./run.sh
Pushed to GitHub
Deployed on Streamlit Cloud
Can access via public URL
All sections display correctly
Graph is interactive
No error messages
Screenshot/video of working app
Link shared with team
Backup plan (run locally if cloud fails)

🎉 You're Done!

You now have:

✅ Professional codebase
✅ Working demo
✅ Online deployment
✅ Easy path to production

The hard part is done. Now focus on your report! 📝

Good luck with your presentation! 🚀

Questions? Check README.md for more details.