The Challenge
The Indonesia Stock Exchange (IDX) releases data on >5% ownership changes daily, but it is buried in fragmented PDF reports.
- The Friction: Investors currently have to download and open files one by one to check if major shareholders are accumulating or dumping stock.
- The Goal: Automate the extraction to visualize "Net Change" and accumulation trends instantly.
The Stack (Serverless Architecture)
- Ingestion: Google Cloud Functions (Python Scraper).
- Storage (Raw): Google Cloud Storage (GCS).
- Transformation: BigQuery (SQL Data Warehousing).
- Visualization: Looker Studio.
The Architecture
The Engineering Logic
I built a fully serverless pipeline to minimize cost and maintenance.
- Scrape: A Python script deployed on Cloud Functions triggers daily. It parses the IDX portal and downloads the raw reports.
- Store: Raw data is dumped immediately into Google Cloud Storage (Data Lake) for auditability.
- Transform: BigQuery ingests the raw files and runs SQL transformations to calculate the "Net Change" (Current Holdings - Previous Holdings).
- Visualize: The Looker Studio dashboard connects to the view, ready for consumption by D+1 morning.
The Insights (Value Provided)
- Net Change Tracker: Instantly identifies which Major Shareholder (e.g., Controllers, Institutions) is reducing or adding positions.
- Trend Analysis: Visualizes the accumulation vs. distribution patterns over time, helping investors spot "Smart Money" movements before the price reacts.
Links
- Live Dashboard: View on Looker Studio
- Code Repository: GitHub
- PDF Version:
Download File
Click to view file