Sources
Sources
Bring any data you want.
Sources (or data sources) are critical components of an organization’s data integration strategy. They represent the origins of data that can be ingested, transformed, and utilized across various platforms and applications to meet diverse business needs. These sources can be a 3rd party API, SaaS, flat files, database, data warehouse, or a data lake.
What types of Sources exist?
Sources are systems, databases, files, or applications from which data is extracted. They can be broadly categorized into:
- Databases: Structured data from relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra).
- Cloud Storage: Data stored in cloud services such as Amazon S3, Google Cloud Storage, and Azure Blob Storage.
- Applications: Data from enterprise applications like ERP systems, CRM platforms (e.g., Salesforce), and marketing tools.
- Flat Files: CSV, Excel, JSON, and XML files that contain structured or semi-structured data.
- Data Lakes and Warehouses: Centralized repositories like data lakes (e.g., AWS Lake Formation) and data warehouses (e.g., Snowflake, Google BigQuery) that store large volumes of data from multiple sources.
Which sources can I connect to?
We currently support the connection of a list of sources that you can check on the platform. If you need to connect to a different data source, not listed yet, please let us know via Slack and we’ll work on it for you.