City_Database_Report
City_Database_Report
Integration
Tasks to be Completed
1. Collect Data on Cities:
- All cities worldwide with a population exceeding 20,000 inhabitants.
- The data must be categorized by country and include standardized information:
- City name (in the local language and, if possible, in English).
- Country to which the city belongs.
- Population (if available).
- Latitude and longitude.
Technical Requirements
To complete this project successfully, the following steps are recommended:
1. Research Data:
- Use reliable sources such as open data databases (e.g., GeoNames, UN, or geographic data
APIs).
- Verify the accuracy of the data and organize it consistently.
2. Data Cleaning and Organization:
- Ensure that city names are standardized.
- Remove duplicates or irrelevant data (e.g., cities with less than 20,000 inhabitants).
- Structure districts hierarchically for major cities.
3. Deliverables:
- CSV/JSON files ready for easy import.
- Documentation on the data structure (e.g., column descriptions, relationships between
tables).
- If possible, a script or method for updating the database in the future.
Required Skills
This project requires the expertise of a data scientist or geographic data specialist, capable
of:
- Handling massive datasets.
- Automating the processing and classification of data.
- Delivering results that are exploitable and adapted to the technical needs of a website.
Conclusion
This database will be a key asset for the future development of my website, allowing for
increased customization and scalability. Your expertise is essential to ensure data quality
and compliance with my technical needs. Please send me your proposal and a quote for this
project.