PDF Download PDF

Andrea Matté

Backend Engineer

Rovereto, IT [email protected] LinkedIn GitHub

Summary

Backend engineer focused on performance-critical data systems, distributed storage, and AI-powered dynamic pricing. I build systems that need to be fast, scalable, and maintainable, often migrating from the legacy version.

Experience

Smartness — Backend Engineer

· Rovereto, IT

Backend engineer on Smartpricing, the AI-powered RMS inside the Smartness platform (4,000+ properties in Europe).

  • Rewrote the core pricing engine that produces all dynamic prices for 4,000+ hotels; migrated gradually with no downtime. Write-up: From custom Python per customer to a parameterized engine.
  • Built from scratch an event scraping and deduplication pipeline, using connected-components graph algorithms and distance-based geospatial filtering to keep stable identifiers across runs. Write-up: Designing a convergent event deduplication pipeline.
  • Co-developed the competitor price scraper, its APIs, and a ScyllaDB storage layer holding ~300 GB compressed at a ~100x compression ratio.
  • Led multiple legacy-to-modern migrations of backends and infrastructure with zero-downtime cutovers.

Selected project

Competitor Pricing — Compression-Friendly Storage Layer

ScyllaDB storage for scraped competitor prices, optimized for high compression and fast reads.

  • Keeps the full dataset online in ~300 GB, with ~100x shrink vs raw JSON and ~20x logical-to-physical compression in production, while serving ~100 QPS and cutting storage by 90% vs the previous format.
  • Designed the partition and clustering key layout for horizontal scale and compression locality, with a custom binary vector encoding for prices.
  • Built the ingestion path from the scraper into Scylla via batching, Kafka and stream processing, keeping write amplification low and absorbing bursts.
  • Technical deep dive: Achieving ~100x Compression on Scraped Pricing Data in ScyllaDB.

Technical skills

Languages
Python, TypeScript, SQL
Backend / APIs
FastAPI, Fastify, REST, async / event-driven services
Data / Storage
PostgreSQL, Redis, ScyllaDB, Cassandra, Kafka, Neo4j
Data analysis & visualization
Pandas, NumPy; interactive data visualization with Plotly, Seaborn, but also custom frontend+backend tools with Vue/Svelte and FastAPI/Fastify
Infra & Ops
Docker, Kubernetes, Helm, GCP, Grafana, CI/CD
Practices
Performance optimization, zero-downtime migrations, parameter- and data-driven design, observability

Education

Università di Trento — M.Sc. Computer Science, 110/110 cum laude

Master of Science in Computer Science and Information Technologies, with a specialization in Data Science.