Skip to main content

AI Data Engineer (Librarian)

Bilue
Sydney, NSW | Melbourne, VIC
hybrid
Full Time / Permanent

Bilue is a digital consultancy that designs and builds smart, user-friendly technology for some of Australia's most well-known businesses. From mobile apps to beautifully designed web platforms and digital experiences, we create solutions that drive impact and deliver exceptional customer outcomes.

Our culture is people-first and purpose-driven. We're a down-to-earth, values-led team with offices in Sydney and Melbourne, and a growing presence in Manila. We genuinely enjoy working together, whether we're solving tough tech problems, brainstorming creative solutions, or grabbing a coffee between meetings. Curiosity is encouraged. Collaboration is second nature. We value excellence, not ego, and back each other to do great work without micromanagement. With low politics and high trust, it's a place where delivery people, designers, and engineers genuinely connect, and where everyone has a voice, space to grow, and a little fun along the way.

The Role

AI systems don't fail because of bad models. They fail because of bad libraries, outdated documents cited as current, knowledge that exists but can't be found, datasets that are technically present but practically untrustworthy. The AI Data Engineer (Librarian) owns that problem.

This is not a traditional data engineering role. It sits at the intersection of data governance, knowledge management, and AI delivery. You will design and maintain the catalogues, metadata schemas, quality frameworks, and lineage structures that every AI system in The Foundry depends on, and you will work directly alongside AI Engineers to connect retrieval systems to the clean, context-aware knowledge stores you've built.

Think of it this way: the AI Engineer builds the system that searches the library. You build the library.

What You'll Do

  • Design and maintain data catalogues for AI projects using platforms such as DataHub, OpenMetadata, Apache Atlas, Collibra, or cloud-native equivalents (AWS Glue Data Catalog, Azure Purview, GCP Dataplex).
  • Define metadata schemas and taxonomy standards — type, version, jurisdiction, validity period, confidence tier — so retrieval systems know not just what a document is, but when it applies and how much to trust it.
  • Assess data quality across client and internal assets using tools like Great Expectations, dbt tests, or Soda; flag stale, superseded, or ambiguous records before they reach the AI layer.
  • Build and maintain data lineage so every AI-generated output can be traced back to its source, version, and validity period — making outputs auditable, not just accurate.
  • Design automated ingestion workflows and nightly quality checks that keep catalogues current without constant manual intervention.
  • Partner with The Foundry engineers to connect RAG pipelines and agentic retrieval systems to catalogue APIs, and with The Labs strategists to map knowledge needs to structured datasets.

What We're Looking For

  • A background in data governance, information management, knowledge management, or records management — with genuine interest in how that work enables AI delivery.
  • Hands-on experience with at least one data catalogue platform (DataHub, OpenMetadata, Collibra, Alation, Apache Atlas, or a major cloud equivalent) and familiarity with metadata standards such as JSON-LD, Dublin Core, or domain-specific ontologies.
  • Strong SQL skills; working Python for data profiling, quality scripting, and metadata automation; and comfort with dbt or OpenLineage for lineage tracking.
  • Understanding of vector databases and RAG architecture — enough to know how metadata quality directly affects retrieval precision.
  • Experience in regulated or high-stakes data environments where provenance and auditability genuinely matter: financial services, insurance, government, healthcare, or similar.
  • A collaborative, low-ego disposition. The Data Librarian's work is structural and often invisible. The glory goes to the AI system. You are fine with that.

Bonus: A formal background in library or information science. Experience with knowledge graphs (Neo4j, RDF, SPARQL). Prior consulting or agency delivery experience.

Life at Bilue

People-first focus: We're committed to delivering exceptional outcomes for our clients, but we know it starts with our people. You'll join a values-led team that's collaborative, curious, and genuinely cares about doing great work, together.

Connection that counts: From monthly anchor days and team lunches to our annual offsite, we create intentional moments to connect, collaborate, and celebrate. These aren't just fun perks — they're part of how we work and grow together.

Flexibility that works: We offer hybrid working, with 1–2 days per week in the office. It's a balance that gives you the space to do your best work, while still creating time to connect and build strong relationships in person.

Strong internal communities: We actively foster internal communities across tech, design, delivery, and beyond, giving you plenty of chances to connect, share knowledge, and learn from your peers.

Opportunities to grow: We invest in your development with unlimited access to Go1's learning library and support from our internal performance coach. Whether you want to deepen your technical skills or grow your leadership potential, we'll back you.

Flat structure, real impact: At Bilue, everyone's voice matters. Our leadership team is hands-on and approachable, and we operate without unnecessary layers. We keep things open and transparent, and your ideas will be heard, no matter your title.

Bilue = Big + Blue Ocean. Are you ready to set sail? Apply now!

NB. This is a full-time position based in Sydney, NSW or Melbourne, VIC. To be considered, candidates must have unrestricted working rights in Australia.

Apply for this job

Posted 2h ago

Engineer (SQL/ Python) - EmpowerUp26

Westpac Group
Sydney, NSW
hybrid
  • Support ADAPT platform users, analytics applications, and onboarding
  • Returning to work after career break - all experience levels welcome
  • SQL, Python, data processing, AI productivity tools, Azure
Posted 9d ago

Head of Data Delivery

Westpac Group
Sydney, NSW | Melbourne, VIC
hybrid
  • Lead enterprise data delivery transformation and AI-enabled delivery models
  • Deep experience delivering large-scale enterprise data platforms
  • Data delivery, platform engineering, AI-enabled execution, stakeholder mgmt
Posted 16d ago

Principal Data Engineer

CommBank
Sydney, NSW
hybrid
  • Lead regulatory reporting data platform solutions
  • Strong experience in data engineering field
  • AWS, Snowflake, ETL framework design, SQL expertise
Posted 20d ago