Code:fao usda.csv: Difference between revisions

no edit summary
(Created page with "fao_usda.csv is a cross-reference table, to connect data from two sources: # FAO Primary Crop Production # USDA Nutrition Data, SR Legacy Purpose: to help analyze crop production in terms of calories, protein, etc. FAO data specifies crop production in tonnes, but for some crops, this includes some inedible parts. For example a ton of bananas includes the peels; a ton of apples includes the co...")
 
No edit summary
 
Line 1: Line 1:
fao_usda.csv is a cross-reference table, to connect data from two sources:
fao_usda.csv is a cross-reference table, to connect data from two sources:
# [[:File:data/primary/fao/crops_primary.csv|FAO Primary Crop Production]]
# [[:File:fao-crops-primary.csv|FAO Primary Crop Production]]
# [[:File:data/sr28-collated.csv|USDA Nutrition Data, SR Legacy]]
# [[:File:sr28-collated.csv|USDA Nutrition Data, SR Legacy]]
Purpose: to help analyze crop production in terms of calories, protein, etc.
Purpose: to help analyze crop production in terms of calories, protein, etc.


Line 332: Line 332:
[5] from "Hempseed in food industry: Nutritional value, health benefits, and industrial applications - "https://onlinelibrary.wiley.com/doi/full/10.1111/1541-4337.12517 : the Multiplier 0.5 is interred from the fact that measurements of protein, fat, and oil - in "whole seed" - are roughly at the midpoint between "dehulled seed" and "hemp hulls". Note: the hulls do in fact have some protein and oils, but the fibre content is so high, i'm counting them as inedible for humans. They may in fact have a place in the human diet. Note: btw, another data source is comparing protein and oil content of shelled and unshelled hemp seeds: https://www.aocs.org/stay-informed/inform-magazine/featured-articles/hempseed-oil-in-a-nutshell-march-2010 see image1 chart
[5] from "Hempseed in food industry: Nutritional value, health benefits, and industrial applications - "https://onlinelibrary.wiley.com/doi/full/10.1111/1541-4337.12517 : the Multiplier 0.5 is interred from the fact that measurements of protein, fat, and oil - in "whole seed" - are roughly at the midpoint between "dehulled seed" and "hemp hulls". Note: the hulls do in fact have some protein and oils, but the fibre content is so high, i'm counting them as inedible for humans. They may in fact have a place in the human diet. Note: btw, another data source is comparing protein and oil content of shelled and unshelled hemp seeds: https://www.aocs.org/stay-informed/inform-magazine/featured-articles/hempseed-oil-in-a-nutshell-march-2010 see image1 chart
[6] Linseed: USDA says flaxseed is 42% fat, but FAO [1] says 35%. Assume the discrepency is due to fao "linseed" data including hulls (flaxseed nutrition data does not). So let's use a Multiplier of 0.833 (=35%/42%)
[6] Linseed: USDA says flaxseed is 42% fat, but FAO [1] says 35%. Assume the discrepency is due to fao "linseed" data including hulls (flaxseed nutrition data does not). So let's use a Multiplier of 0.833 (=35%/42%)
[7] Oil palm fruit: To get the Multiplier, we look at a different FAO dataset: [[:File:data/primary/fao/crops-processed.csv]], which includes the production of palm oil (item code 257) and palm kernel oil (item code 256). Add those two together, and divide by the ''oil palm fruit'' of this dataset, and you get the Multiplier. Using the most recent 2 years where data is available (2018 and 2019), we get (71735061 + 74583225 + 7918019 + 8226464) / (409265212 + 415898058) = 0.1968856. For simplicity sake, we count all the oil as palm oil, even though a small percent is palm ''kernel'' oil. The latter has a bit more saturated fat, but the calories are the same.
[7] Oil palm fruit: To get the Multiplier, we look at a different FAO dataset: [[:File:fao-crops-processed.csv]], which includes the production of palm oil (item code 257) and palm kernel oil (item code 256). Add those two together, and divide by the ''oil palm fruit'' of this dataset, and you get the Multiplier. Using the most recent 2 years where data is available (2018 and 2019), we get (71735061 + 74583225 + 7918019 + 8226464) / (409265212 + 415898058) = 0.1968856. For simplicity sake, we count all the oil as palm oil, even though a small percent is palm ''kernel'' oil. The latter has a bit more saturated fat, but the calories are the same.
[8] Olives: USDA data is only available for canned olives. Such data shows 10% oil. But FAO [1] says olives are 22% oil. If we assume that the discrepency is entirely because the canned olives are diluted with saltwater... then we use a Multiplier of 2.2. But then again, olives intended for oil probably have more oil whereas olives for canning might have more starch and/or protein. So I'm lowering the Multiplier to 2.0.
[8] Olives: USDA data is only available for canned olives. Such data shows 10% oil. But FAO [1] says olives are 22% oil. If we assume that the discrepency is entirely because the canned olives are diluted with saltwater... then we use a Multiplier of 2.2. But then again, olives intended for oil probably have more oil whereas olives for canning might have more starch and/or protein. So I'm lowering the Multiplier to 2.0.
[9] Seed cotton is "55%-65%" seeds [2], and cottonseeds are 16% oil [1]. Thus, Multiplier = 60% * 16% = 9.6% = 0.096
[9] Seed cotton is "55%-65%" seeds [2], and cottonseeds are 16% oil [1]. Thus, Multiplier = 60% * 16% = 9.6% = 0.096